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PREFACE 


Anyone  who  has  worked  in  education  or  mass  communica¬ 
tion,  knows  how  hard  it  is  to  change  men's  minds  or  arouse 
their  interest.  A  classic  study  of  the  problem  was  done  in 
Cincinnati  on  a  United  Nations  Association  campaign;  Dr.  Kramer 
examines  it  in  this  monograph.  But  there  are  many  other  such 
studies.  Voting  by  Bernard  Berelson,  Paul  Lazarsfeld,  and 
William  McPhee  established  that  few  people  are  converted  by 
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a  Presidential  campaign.  Studies  of  psychological  warfare , 
on  which  we  shall  comment  more  fully  below,  chow  too  that 
defectors  are  won  only  from  among  already  alienated  troops. 

The  notion  that  "Hidden  Persuaders"  can  by  a  few  devious 
tricks  manipulate  large  publics  by  mass  media  has  long  since 
been  exposed  as  a  myth,  the  widespread  survival  of  which 
illustrates  once  more  how  hard  it  is  for  information  to  change 
presuppositions . 

There  is  one  good  reason  why  this  myth  survives  and 
that  is  that  up  to  now  we  have  had  no  good  means  of  measuring 
the  interaction  of  stimulus  and  response  in  ordinary  communica¬ 
tions  situations.  We  have  had  no  way  of  measuring  how  much 
communication  input  produces  what  communication  output.  The 
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most  elementary  kind  of  effectiveness  measurement  has  been 
unavailable.  It  is  that  problem  which  Dr.  Kramer  addresses 
in.  this  monograph. 

In  this  preface,  let  us  look  first  at  what  the 
the  special  problems  of  measurement  are  that  make  it  hard 
to  estimate  cause-effect  relations  in  communication  situations. 
Then  we  shall  consider  the  resulting  myth  that  exaggerates 
the  power  of  psychological  operations  and  the  countermyth  that 
denigrates  them.  Finally,  let  us  note  the  possibilities 
offered  for  making  better  estimates  of  the  quantitative  rela¬ 
tionship  between  stimulus  and  response  by  simulation  of  mass 
media  processes. 

Measuring  Stimulus-Response  Relations 

In  a  psychological  laboratory  one  can  measure  how  far 

varying  the  amounts  of  what  is  communicated  varies  the  response 

of  the  subject  —  usually  a  college  student  hired  for  the  job. 

To  cite  just  one  example  among  hundreds  in  the  large  literature 

of  experimental  social  psychology,  Whittaker  has  shown  that  the 

largest  amount  of  attitude  change  occurred  in  his  sibjects  when 

the  propaganda  they  received  was  neither  very  similar  to  what 
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they  already  believed,  nor  very  different,  but  intermediate. 

In  the  laboratory  one  can  establish  such  quantitative  relation¬ 
ships  of  cause  and  effect.  One  can  measure  the  quantity  and  the 


Jaimes  0.  Whittaker,  "Attitude  Change  and  Communication- 
Attitude  Discrepancy,"  Journal  of  Social  Psychology,  65  (1965), 
pp.  141-147. 
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characteristics  of  the  communication  to  which  the  subject  is 
exposed  and,  by  psychometric  devices,  the  attitude  or  cogni¬ 
tions  of  the  subject  before  and  after.  The  finding  of  such 
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experiments,  as  Hovland  has  noted,  is  almost  always  that  somer 
measurable  change  occurs  as  a  consequ<  :nce  of  communication. 

In  the  field,  on  the  other  hand,  as  Hovland  pointed 
out,  the  usual  finding  is  one  of  no  attributable  change  following 
communication.  Often  from  the  be fore -measure  to  the  after¬ 
measure  there  is  no  change  at  all.  More  often  the  world  has 
changed  between  the  two  observations  and  so  have  some  attitudes, 
but  not  in  a  way  readily  attributable  to  the  stimulus  communica¬ 
tion.  Too  many  intervening  variables  block  change  or  make  it 
random  to  permit  any  causal  conclusion  to  be  drawn.  Among  the 
intervening  variables  are  the  processes  studied  by  Lazarsfeld 
and  emphasized  by  Hovland  in  his  classic  essay  comparing 
laboratory  and  field  results,  namely  the  freedom  left  to  the 
audience  members  in  the  real  world  to  select  for  attention 
only  those  communications  that  are  compatible  with  their  pre¬ 
vious  views  and  which  will  therefore  not  impel  change  in  them. 

But  the  selective  processes  are  not  all  encompassing.  People 
do  expose  themselves  to  communications  that  produce  change  in 
them.  Yet  it  has  remained  hard  for  the  researcher  to  establish 
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conclusively  the  link  between  communi cations  cause  and  atti- 
tudinal  effect.  Our  techniques  of.  observation  have  been 
inadequate  to  do  that. 

The  social  sciences  have,  however,  developed  excel¬ 
lent  techniques  for  measuring  attitudes  and  opinions  of  an 
audience  and  changes  in  them.  The  audience  end  of  the  communi¬ 
cations  process  can  be  rigorously  measured  either  in  the  labora¬ 
tory  or  in  the  field.  Public  opinion  polls,  for  example,  spot 
changes  of  a  few  percentage  points  in  what  the  public  thinks. 

And  because  we  have  a  good  technique  for  documenting  the  response 
end  of  the  continuum,  communication  research  has  tended  to 
become  audience  research.  We  rarely  study  the  full  act  of  copi- 
munication,  which  is  a  bilateral  process  between  message  and 
audience . 

The  original  Cincinnati  study  which  Dr.  Kramer  examines 
is  an  illustration  of  the  point.  The  United  Nations  Association 
planned  a  major  information  campaign  for  Cincinnati.  Wisely, 
they  decided  to  evaluate  the  effort.  They  employed  the  National 
Opinion  Research  Center  to  do  a  survey  of  opinions  and  attitudes 
about  the  U.N.  before  and  after  the  campaign.  With  many  quali¬ 
fications,  the  best  one  sentence  summary  of  the  results  is  that 
little  planned  change  occurred.  Attitudes  and  information  about 
the  U.N.  were  substantially  similar  after  to  what  they  had  been 
before . 

Note,  however,  the  research  design  of  the  Cincinnati 
study.  There  were  careful  measurements  of  audience  attitudes 
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before  and  after,  but  no  measures  of  the  totality  of  the  stimuli 
that  impinged  upon  the  audience.  The  dimensions  of  what  the 
campaign  managers  themselves  put  out  were  known,  but  not  the 
much  more  massive  product  of  the  normal  news  media.  It  was 
assumed  that  the  campaign  was  sufficiently  massive  that  some¬ 
how  it  should  have  gotten  through  to  everyone. 

Doubt  about  the  proposition  that  the  entire  public 

was  effectively  reached  lay  behind  the  Kramer  study.  We  know 

that  people  select  for  attention  a  small  percentage  of  the 

foreign  affairs  news  that  appears  in  their  papers  (13  to  22% 

* 

is  the  estimate  Kramer  uses).  We  know,  furthermore,  that  no 
medium,  much  less  message  within  it,  gets  to  more  than  a  frac¬ 
tion  of  the  total  audience.  Knowledge  of  these  facts  stirred 
the  suspicion  that  perhaps  the  reason  that  the  observed  audi¬ 
ence  changes  during  the  Cincinnati  U.N.  campaign  were  so  small 
was  that  the  selective  processes  operating  on  the  U.N.  infor¬ 
mation  output  attenuated  the  frequency  with  which  it  actually 
reached  Cincinnati  citizens  to  a  very  low  level.  Perhaps  it 
was  a  self-delusion  of  those  involved  in  the  vigorous  activities 
of  the  U.N.  information  campaign  to  believe  that  it  must  be 
visible  to  everyone.  Dr.  Kramer's  monograph  tends  to  sustain 
the  suspicion  that  frequency  of  exposure  was  not  great.  For 
example,  comparing  Kramer's  content  analysis  figures  with  cumu¬ 
lative  exposure  figures  that  result  from  the  simulation,  we  find 
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that  the  average  man  saw  or  heard  between  3  and  5  per  cent  of 
the  messages  put  out  and  the  average  woman  between  1  and  2-1/2 
per  cent.  Thus  even  though  there  were  142  stories  about  the 
campaign  sponsors,  they  reached  the  average  man  but  seven 
times  in  six  months  and  the  average  woman  four  times.  Expla¬ 
nations  of  the  U.N.  appeared  43  times  in  the  course  of  the 
campaign.  The  average  man  saw  two  of  these,  and  the  average 
woman  one.  There  were  256  stories  about  the  veto  power, 
enough  one  might  think  to  affect  the  attitude  of  any  popula¬ 
tion,  but  the  average  man  was  reached  by  such  stories  only 
nine  times  in  six  months  and  the  average  woman  six  times.  A 
news  story  or  other  tidbit  on  the  veto  once  every  three  weeks 
or  month  is  not  going  to  change  many  attitudes  in  six  months 
time  if  ever. 

To  arrive  at  estimates  such  as  those  we  have  just 
been  quoting,  one  must  first  have  means  for  measuring  what 
stimuli  actually  reached  the  average  audience  member.  We 
have  for  a  long  time  had  the  capability  of  measuring  propa¬ 
ganda  output.  Content  analysis  of  media  enables  us  to  say 
how  many  column  inches  in  newspapers ,  how  many  words  in  maga¬ 
zines,  how  many  minutes  on  radio  or  TV  is  devoted  to  any  given 
topic  such  as  the  U.N.  Content  analyses,  however,  have 
generally  been  divorced  from  audience  studies.  They  have 
therefore  had  little  to  tell  us  about  the  cause-effect  or 
communications  impact  process.  They  may  have  other  values. 


For  example,  a  remarkable  recent  content  analysis  of  news 
about  the  United  Nations  in  press  and  broadcasts  in  50 
countries,  directed  by  Dr.  Alexander  Szalai  of  UNITAR,  re¬ 
vealed  dramatically  that  the  main  U.N.  coverage  in  1968 
(as  Dr.  Kramer  found  in  Cincinnati  in  1947)  was  of  crises  and 
international  conflict,  with  but  little  attention  to  the 
developmental,  welfare,  and  other  day-to-day  activities  of 
the  organization.  However,  such  findings  have  in  the  past 
seldom  been  linked  to  measurements  of  the  amount  of  attitude 
change  resulting  from  the  greater  or  lesser  coverage  received 
by  different  topics. 

To  make  such  a  linkage  of  communication  stimulus  to 
audience  response  requires  a  more  complex  design  than  simply 
to  correlate  content,  analysis  measures  with  attitude  change 
measures.  The  reason  for  not  being  satisfied  with  such  a 
design  is  that  appe<irance  of  a  column  inch  or  minute  of 
material  in  the  media  is,  as  already  noted,  not  a  good  measure 
of  opportunity  for  audience  exposure.  Appearance  of  a  column- 
inch  at  the  tcp  of  page  one  of  a  newspaper  with  a  circulation 
of  oiie  million  is  obviously  not  the  same  thing  as  a  column  inch 
buried  inside  a  minor  local  paper.  There  are  many  factors 
besides  mere  content  volume  that  must  be  taken  into  account  if 
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one  wishes  to  measure  what  stimulus  material  reached  particular 
persons.  For  example,  two  inches  or  two  items  one  after  another 
in  the  same  paper  is  not  the  same  stimulus  as  one  each  in  two 
different  media.  The  same  N-thousand  people  see  both  items  in 
one  case,  more  people  see  less  material  apiece  in  the  other. 

Thus  a  simple  correlation  of  content  analysis  results  with  mea¬ 
sures  of  attitude  change  is  most  likely  to  produce  but  small 
explanations  of  variance,  although  with  a  strong  suspicion  that 
the  weak  relations  are  the  result  not  of  reality  but  of  having 
only  a  poor  measure  of  what  actually  came  through  to  the 
audience. 

Because  content  analysis  does  not  measure  what  comes 
through  to  the  audience,  most  researchers  interested  in 
measuring  media  impact  have  used  interviews  or  similar  audience 
measures  not  only  for  measuring  attitude  change  but  a  so  for 
measuring  exposure.  Magazines  and  newspapers  do  not  rely  on 
their  circulation  figures  alone  to  prove  their  reach,  but 
commission  audience  surveys  to  measure  readership  among  dif¬ 
ferent  kinds  of  persons  and  by  kind  of  page.  Radio  and  TV 
researchers  use  diaries ,  recording  attachments ,  simultaneous 
telephone  interviews,  and  recall  to  establish  detailed  ratings 
of  shows  at  different  times  of  day  and  by  different  kinds  of 
people . 

For  at  least  three  reasons,  however,  even  this  data, 
excellent  as  much  of  it  is,  does  not  meet  our  need  for  an 
independent  measure  of  exposure  to  relate  to  observed  responses. 
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First#  and  probably  least  important ,  is  the  fact  that  mea¬ 
suring  both  exposure  and  response  by  interview  answers  may 
result  in  confounding  of  that  which  we  wish  to  separate.  A 
person's  statement  about  what  arguments  or  messages  he  read 
or  heard  is  in  part  a  statement  about  which  arguments  or 
messages  persuaded  or  impressed  him,  rather  than  being  a  pure 
measure  of  exposure.  That  problem  is  rather  acute  in  studies 
(such  as  the  Cincinnati  one)  which  rely  for  all  their  data  on 
social  attitude  surveys  or  public  opinion  polls  thav  .  also 
are  used  to  measure  respondent  attitude.  It  is  less  acute  for 
the  rating  services  and  similar  media  studies  which  use  some¬ 
what  carefully  controlled  measures  of  exposures  such  as  a 
dummy  magazine  for  aided  recall  or  a  diary. 

A  second,  more  serious,  problem  is  that  carefully 
collected  rating  data  is  often  unavailable.  It  is  usually 
unavailable  to  the  social  science  researcher  who  cannot 
afford  a  careful  field  study  of  exposure  to  the  particular 
output  (such  as  the  U.N.  campaign)  in  which  he  is  interested. 

He  must  usually  be  satisfied  with  standard  statistics  of 
circulation  and  audience  not  pinpointed  to  his  particular  stimulus 
material.  From  those  general  media  statistics  he  must  cal¬ 
culate  estimates  of  the  audience  for  the  material  of  particu¬ 
lar  interest  to  him.  That  is  one  of  the  things  our  mass  media 
simulation  facilitates  and  which  Dr.  Kramer  did  in  the  present 
study. 


Furthermore,  for  many  purposes,  such  as  studies  of 
the  reach  of  communications  in  closed  societies ,  the  researcher, 
is  not  able  to  collect  data  in  the  field.  Dr.  Kramer's  study, 
done  in  the  United  States  where  abundant  audience  data  was 
available,  served  as  a  methodological  test  for  a  simulation 
that  in  other  studies  is  being  applied  to  measuring  communi¬ 
cation  in  the  Soviet  Union. 

A  third  problem  is  that  exposure  data  collected  by 
interview  or  other  audience  interrogation  devices  is  usually 
collected  one  medium  at  a  time.  Nielsen  ratings  exist  for 
TV  shows.  Each  major  magazine  or  newspaper  has  its  own  audi¬ 
ence  study.  It  is  more  than  a  trivial  matter  to  put  these 
together.  Suppose  we  know  that  20  million  persons  saw  a  par¬ 
ticular  TV  show  with  a  message  of  interest  and  that  20  million- 
saw  a  magazine  that  contained  the  same  message.  One  cannot 
conclude  that  40  million  were  reached,  since  some  persons  saw 
both  the  TV  show  and  the  magazine.  What  can  we  conclude  about 
how  many  persons  are  exposed  once,  twice,  or  zero  times?  As 
Dr.  Kramer  explains  in  this  monograph,  our  simulation  model 
is  designed  in  part  to  produce  estimates  of  that  kind. 

All  the  considerations  just  outlined  as  to  why  neither 
content  analysis ,  nor  audience  surveys ,  nor  ratings  alone  give 
us  am  adequate  measure  of  exposure  led  us  to  design  the  mass 
media  message  diffusion  simulation  that  is  described  and  tested 
in  this  volume.  The  purpose  of  the  simulation  program  is  to 
provide  estimates  of  who  was  reached  how  often  by 


what  messages  in  a  national  or  other  population  subjected 
to  some  information  flow.  The  inputs  are  data  about  factors 
which  must  be  taken  into  account,  including  the  volume,  fre¬ 
quency,  formats,  and  location  of  messages,  and  the  size  and 
characteristics  of  the  audience  population,  their  media  habits, 
and  special  events  (such  as  crises)  in  the  real  world  which 
may  change  their  media  habits.  The  simulation  accepts  what¬ 
ever  information  there  is  about  all  these  variables  and  then 
puts  out  estimates  of  what  kinds  of  persons  have  been  reached, 
how  often,  by  each  theme  in  the  contents.  That  is  a  stimulus 
measure  that  is,  hopefully,  more  useful  than  the  crude  content 
analysis  with  which  we  start,  and  therefore  one  that  is  more 
appropriate  to  correlate  with  attitudinal  response  data. 

That  is  what  Dr.  Kramer  has  done  for  the  Cincinnati 
U.N.  campaign.  Why  that  campaign?  That  is  a  rare  case  where 
there  exists  well  collected  before  and  after  attitude  measure¬ 
ments,  and  also  where  there  was  a  well  defined  propaganda 
campaign  that  could  be  documented  by  a  content  analysis,  and 
finally  where  all  the  necessary  population  and  media  data 
exist  so  the  content  analysis  results  and  that  data  could  be 
fed  into  the  simulation  model.  The  Cincinnati  U.N.  campaign 
was  therefore  an  ideal  instance  on  which  to  make  a  first 
trial  of  the  mass  media  message  diffusion  simulation,  to  see 
how  it  worked  and  whether  it  produces  any  insight  as  to  the 
impact  of  the  flow  of  messages  on  attitude  change. 

The  goal  toward  which  we  are  working,  one  toward 
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which  some  steps  have  been  taken  in  this  first  test,  is  to 
create  a  device  that  will  permit  us  to  do  studies  in  nature 
such  as  those  we  now  do  only  in  the  laboratory,  that  measure 
how  much  impact  propaganda  or  information  have  on  the  people 
who  receive  it. 

How  Much  Impact  Does  Communication  Have? 

We  alluded  above  to  the  myth  of  all-powerful  propa¬ 
ganda.  It  stems  in  part  from  the  claims  of  advertising  men 
and  public  relations  men  themselves,  for  they  need  to  empha¬ 
size  the  value  of  their  services.  It  also  stems  in  part  from 
World  War  I  and  the  argument  of  German  nationalists  that  they 
were  not  defeated  on  the  field  of  battle,  but  were  taken  in  by 
Allied  propaganda.  More  careful  studies  of  psychological  war¬ 
fare  experience  give  little  support  to  the  notion  that  the 
airdropped  leaflets  had  any  major  impact  in  changing  German 
minds.  What  every  careful  study  has  shown  is  that  well  timed, 
well  planned  psychological  warfare  can  be  very  effective 

indeed,  if  —  but  only  if  —  it  provides  information  to  help 
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the  audience  do  what  it  is  already  predisposed  to  do.  Janowitz 
and  Shils  in  a  classic  study  found  in  World  War  II  that  Wehr- 
macht  members  were  likely  to  respond  to  surrender  appeals  only 
to  the  extent  that  cohesion  had  already  broken  up  in  their 
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small  units  by  the  failure  of  leadership  by  the  non-coms. 

In  Viet  Nam  studies  of  Viet  Cong  returnees  to  the  government 
side  showed  that  by  1966-67  virtually  all  Viet  Cong  members 
knew  about  the  Open  Arms  program  through  airdrop  leaflets  and 
radio  broadcasts  and  knew  from  these  how  to  go  about  surren¬ 
dering,  but  that  actual  defection  occurred  only  when  family 
pressures,  or  appeals  by  former  buddies,  or  mistreatment  in 
their  Viet  Cong  unit  created  a  first-hand  personal  pressure. 

Analogous  points  could  be  made  from  the  literature 
on  advertising,  or  the  literature  on  election  campaigns,  or 
the  literature  on  adoption  of  innovations.  All  of  them  show 
no  simple  direct  relation  between  the  volume  of  mass  communi¬ 
cation  and  the  audience  response. 

There  are  two  functions  which  mass  communications  do 
serve  in  the  complex  process  of  producing  an  attitude  change 
or  action.  Mass  communications  lay  a  foundation  of  basic 
information,  creating  in  i-ha+  way  the  environment  in  which 
any  other  influences  may  act,  as,.' for  example,  enabling  tne'  Viet 
Cong  to  know  how  to  surrender  if  they  wish-  Secondly,  mass 
communications  or  any  other  kind  of  communications  may  be 
the  trigger  which  sets  off  a  reaction  the  character  of  which 
is  determined  far  more  by  the  system  that  is  triggered  than  by 
the  trigger  itself. 


In  the  literature  on  communications  effects  two  main 

models  may  be  found,  the  so-called  vector  model  and  the  so- 
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called  trigger  model.  The  vector  model  sees  the  audience's 
predisposition  as  a  force  going  in  one  direction  and  the 
communications  input  as  another  force  going  in  another  direc¬ 
tion,  and  the  net  effect  in  behavior  as  a  resultant  of  these 
two  forces. 


DIRECTION  AND  FORCE  OF  WHAT  THE 
HEARER  BELIEVED  OR  DID  AFTER 


That  vector  model  is  not  necessarily  naive.  It  does 
not  treat  propaganda  as  all-powerful,  as  do  some  popular  writers 
on  persuasion.  The  model  allows  appropriate  weight  to  the  pre¬ 
dispositions  of  the  audience.  It  takes  account  of  the  inertia 
against  change  rooted  in  the  hearer's  prejudice  and  made  effective 
by  mechanisms  of  psychological  defense.  The  vector  model  also 
encompasses  findings  as  subtle  as  the  "sleeper  effect,"  in  which 
the  fact  that  messages  come  from  disapproved  sources  is  gradually 
forgotten,  while  their  contents  come  to  be  accepted.  The  vector 
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pattern  which  expressed  that  finding  shifts  through  time,  so 
that  vectors  which  were  earlier  opposed  to  each  other  come  to 
be  seen  as  having  the  same  direction. 

But  many  phenomena  that  do  not  fit  the  vector  model 
do  fit  a  trigger  model.  A  trigger  is  something  that  starts 
a  process,  but  has  little  to  do  with  how  the  process  works. 

To  analyze  t..e  trajectory  of  a  bullet  one  needs  to  know  the 
charge  behind  it,  its  shape,  the  bore  of  the  gun,  the  direc¬ 
tion  of  aim,  but  not  anything  about  the  trigger.  The  trigger 
only  sets  off  a  process  whose  determinants  were  in  the  things 
it  touched.  Communications  can  work  that  way. 

First,  there  lies  latent  in  individuals 
a  great  collection  of  traces  of  previous  com¬ 
munications.  Any  new  communication  may  serve 
to  change  this  massive  structure  only  impercep¬ 
tibly,  but  it  may  at  the  same  time  set  it  into 
action  in  directions  determined  by  the  struc¬ 
ture  itself  more  than  by  the  trigger  stimulus. 

Second,  the  event  triggered  within  the 
system  may  itself  have  more  effect  on  the  system 
than  does  the  o_iginal  stimulus.  Arguing  for 
one's  views  in  reply  to  a  challenge  may  have 
more  effect  on  one  than  does  the  challenge. 

Third,  within  an  individual  many  latent 
attitudes  may  be  simultaneously  present.  The 
structure  of  social  controls  and  social  rela¬ 
tions  may  make  some  of  them  easier  to  express 
them  others.  Thus,  even  stimuli  which  have 
a  persuasive  effect  on  a  man's  thought  may 
trigger  quite  opposite  expressions. 

Fourth,  where  a  stimulus  is  addressed  to 
a  population  of  individuals,  structural  deter¬ 
minants  may  result  in  its  mobilizing  different 
proportions  of  those  who  agree  and  those  who 
disagree  with  it.  If  it  mobilizes  more  of  those 
who  disagree,  the  stimulus  may  boomerang.* 


Examples  of  such  trigger  effects  were  found  in  the 
foreign  trade  debate  of  1960-61  where  pro-free  trade  propaganda 
stirred  up  protectionists  to  action  more  than  it  did  those  who 
sympathized  with  the  propaganda. 

If  all  the  points  just  made  about  the  complexity  of 
the  cause-effect  relation  between  communication  and  response 
are  correct/  then  why  should  we  be  trying  to  create  a  device 
to  permit  estimates  of  correlation  between  quantity  of  exposure 
and  quantity  of  attitude  change?  There  is  every  reason  to 
expect  the  correlations  to  be  small.  Communications  will  only 
work  on  those  groups  and  in  those  situations  where  predisposi¬ 
tions  are  favorable.  For  long  periods  massive  information  may 
only  be  setting  a  latent  environment  of  knowledge  and  images. 

In  other  periods  a  small  stimulus  may  trigger  off  that  mass  of 
latent  predispositions. 

Yet  even  though  our  expectation  is  for  weak  relation¬ 
ships,  it  is  of  value  to  create  a  tool  that  enables  us  to 
measure  more  precisely  than  we  now  can  what  the  magnitude  of 
these  relationships  may  be.  In  the  first  place,  one  cannot 
dismiss  the  vector  model.  There  is  substantial  evidence  from 
laboratory  experiments  and  from  cost-effectiveness  studies  of 
advertising  that  repetition  has  its  effect.  At  least  under  some 
circumstances,  the  amount  of  persuasion  is  a  function  of  the 
amount  of  communication.  It  would  be  useful  to  know  how  often, 
how  much,  and  under  what  circumstances  that  is  the  case. 

Secondly,  even  in  trigger  situations  there  are 
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quantitative  parameters.  How  many  people  get  exposed  enough 
for  the  trigger  to  go  off?  What  kinds  of  people  are  these? 

How  much  latent  information  had  to  be  there  first,  and  when 
and  to  whom  did  it  come? 

Thus  Dr.  Kramer  approaches  the  simulation  model 
neither  with  the  naive  expectation  that  there  should  be  a  power¬ 
ful  relation  between  the  amount  of  exposure  to  U.N.  information 
and  the  amount  of  attitude  change,  nor  with  the  superficial 
notion  that  nothing  can  be  learned  by  examining  the  quantita¬ 
tive  relations  between  them.  The  mass  media  simulation  per¬ 
mits  us  to  learn  more  about  what  the  dynamics  of  the  relation¬ 
ship  are . 


The  Simulation 

We  have  already  said  enough  about  the  mass  media 
simulation  that  is  exercised  in  this  volume  so  that  the  reader 
knows  that  its  purpose  is  to  provide  an  estimate  of  the  frequency 
of  exposure  of  different  types  of  audience  members  to  different 
themes  in  the  media  content.  At  first  glance  it  is  an  extremely 
naive  simulation.  It  represents  in  the  computer  a  sample  of 
the  population  each  member  of  which  has  certain  demographic 
characteristics  and  each  member  of  which  also  has  certain  media 
habits,  such  as  subscriptions  or  probabilities  of  turning  on 
certain  broadcasts.  Then  a  scenario  is  written  which  locates 
each  occurrence  of  each  theme  in  each  issue  of  each  medium. 

Then  the  computer  passes  the  scenario  over  each  individual  in 


xviii 


the  imaginary  sample  and  keeps  a  count  of  each  probable 
exposure . 

If  there  were  no  other  problems  to  it,  that  would 
be  a  very  pedestrian,  inefficient  number  grinder.  These 
operations  could  be  done  better  by  solution  of  probability 
equations.  There  are,  however,  four  subtle  problems  that 
enter  the  simulation,  that  give  it  its  intellectual  interest, 
and  that  make  mandatory  a  simulation  approach  rather  than  an 
analytic  solution.  These  are  all  discussed  in  Dr.  Kramer's 
volume.  They  are  designated  by  the  labels  cumulation,  dupli¬ 
cation,  triggering,  and  estimation  of  full  arrays  from  incom¬ 
plete  information. 

Cumulation :  Messages  in  the  mass  media  are  events  in 
vehicles  that  appear  in  regular  patterns  over  time.  Broad¬ 
casts  are  patterned  into  daily  or  weekly  programs  on  discrete 
channels.  Print  media  appear  as  serial  issues  of  fixed  publi¬ 
cations.  The  members  of  the  audience  have  regular  habits 
regarding  these  broadcasts  or  publications.  The  significance 
of  that  fact  is  that  message  exposures  are  not  statistically 
independent  events.  If  John  Doe  subscribes  to  a  pro-U.N.  paper 
then  the  chance  of  his  reading  repeated  stories  about  the  U.N, 
is  high.  It  will  not  do  to  randomly  assign  exposures  to  stories 
about  the  U.N.  to  all  members  of  the  hypothetical  population. 
Some  of  them  have  media  hrbits  that  will  expose  them  often,  and 
some  have  habits  that  will  expose  them  seldom.  Consider  fur¬ 
ther  the  difference  between  a  scientific  journal  and  an  airdrop 


leaflet.  Virtually  all  issues  of  a  scientific  journal  go  to 
subscribers  so  that  the  probability  of  a  person  who  sees  one 
issue  also  seeing  the  next  is  perhaps  60%  or  70%  or  80%.  An 
airdrop  leaflet,  on  the  other  hand,  is  received  pretty  ran¬ 
domly.  The  chances  of  seeing  two  in  a  row  may  be  small.  Con¬ 
sider  the  implications  of  this.  Consider  two  media  both  of 
which  reach  .5  of  the  population.  In  two  issues,  a  medium 
which  always  goes  to  exactly  the  same  persons  would  reach  .5 
of  the  population,  each  exactly  twice.  Another  whose  distri¬ 
bution  was  completely  random  would  have  reached  .75  of  the 
population;  .5  of  the  population  once,  .25  twice.  Dr.  Kramer 
developed  the  problem  of  cumulation  and  a  model  for  dealing 
with  it  in  this  volume.  It  is  a  problem  which  has  been  much 
discussed  in  the  research  literature  on  advertising.  Dr. 

Kramer's  treatment  raises  the  discussion  to  a  new  level  of 
sophistication. 

Duplication :  Cumulation  is  a  problem  because  expo¬ 
sure  to  messages  are  not  independent  events  but  are  autocorrelated 
over  time.  Duplication  is  a  different  but  analogous  problem: 
Exposure  to  different  message  vehicles  is  not  independent. 

They  are  correlated  with  each  other.  For  example,  a  perron 
who  reads  the  Atlantic  is  more  likely  than  the  average  person 
to  read  the  New  Yorker,  and  less  likely  to  watch  Gunsmcke.  But 
is  the  Atlantic  reader  more  or  less  likely  to  read  Harper's? 

The  answer  is  not  obvious.  Perhaps  the  Atlantic  saturates  most 
of  its  readers'  desire  fcr  elevated  current  commentary  or  perhaps 
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that  factor  is  overridden  by  the  similar  taste  to  which  the 
two  magazines  appeal.  Whether  it  turns  out  in  fact  that  these 
magazines  are  primarily  surrogates  for  each  other  or  share  a 
joint  demand,  it  is  nonetheless  clear  that  they  are  not  statis¬ 
tically  independent.  Somenow  account  must  be  taken  of  the  fact 
that  use  of  one  medium  has  implications  for  use  of  the  other. 

Triggering :  The  scenarios  that  we  write  in  the  simu¬ 
lation  serve  to  map  out  the  occurrence  of .various  messages. 

These  messages  reach  people  as  a  function  of  their  basic  media 
habits.  But  the  messages  may  in  turn  modify  these  habits.  If 
a  person  is  exposed  to  the  message  "Soviet  troops  invade  Czech¬ 
oslovakia,  "  he  is  likely  to  rush  to  his  radio  set  and  keep  it 
on,  hearing  much  more  news  than  he  otherwise  would  have.  The 
simulation  allows  for  such  triggering  by  which  normal  media 
habits  get  changed.  Note  that  in  the  simulation  we  use  the 
word  "triggering"  in  a  sense  that  is  a  special  case  of  what  we 
above  called  the  trigger  model  of  communication.  We  use  the 
term  triggering  to  describe  one  particular  response  to  a  commu¬ 
nication  trigger,  namely  a  change  in  media  habits. 

Estimation  of  Full  Arrays  from  Incomplete  Information: 

By  now  it  will  have  occurred  to  the  reader  that  the  simulation 
Dr.  Kramer  is  using  calls  for  a  vast  amount  of  information.  We 
need  information  about  the  demographic  characteristics  of  the 
population,  their  habits  with  regard  to  every  medium,  the  details 
of  occuzrence  of  every  message,  cumulation  data,  duplication  data, 
and  a  few  other  things.  Clearly  all  that  data  never  exists. 
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One  of  the  main  design  objectives  for  this  simulation  was  that 
it  should  accept  whatever  data  does  oxist  and  estimate  from  that' 
whatever  data  is  lacking.  For  that  operation  it  was  particularly 
important  that  the  data  input  phase  of  the  simulation  be  inter¬ 
active  between  researcher  and  computer,  for  during  data  input 
and  estimation  there  may  be  many  decisions  to  be  made. 

Chapter  II  of  Dr.  Kramer's  monograph,  especially, 
describes  some  of  the  main  routines  for  estimation  of  missing 
data.  We  rely  heavily  on  a  routine  which  we  ha'*e  named  "Mostel- 
ierization"  after  the  scholar  whose  idea  we  use,  and  which  is 
designed  to  provide  estimates  of  cell  values  in  a  matrix  where- 
we  know  the  marginals  and  also  have  some  basis  for  specifying 
interactions . 

Results 

While  the  main  value  of  Dr.  Kramer's  work  is  clearly 

A 

methodological,  it  does  tell  us  some  interesting  things  about 
the  much-studied  Cincinnati  U.N.  campaign.  It  confirms  the 
suspicion  that  except  for  crisis  news ,  despite  the  massive 
efforts  of  the  United  Nations'  supporters  in  Cincinnati,  many 
citizens  heard  very  little  about  the  U.N.  during  the  campaign. 

The  study  also  shows  a  rather  clear  correlation  between 
the  themes  most  covered  in  the  news  and  the  themes  on  which 
most  change  occurred  over  the  period  studied.  The  period  was 
one  of  much  international  crisis  activity  and  during  it  fear 
of  war  increased  markedly.  On  the  other  hand,  routine  U.N. 
information  got  through  to  the  public  very  little  and  attitudes 
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and  information  about  normal  U.N.  roles  changed  but  little. 
Thus,  across  themes  we  find  a  substantial  correlation  between 
the  messages  carried  and  the  audience  response. 

The  evidence  that  public  opinion  changed  most  in 
regard  to  those  topics  on  which  it  was  r.iOst  exposed  to  mass 
media  messages  is  presented  in  the  following  table,  which 
matches  up  as  well  as  we  can  topics  covered  in  the  mass  media 
with  questions  asked  on  the  NORC  survey.  Clearly  exposure  to 
information  was  a  prerequisite  for  change  in  attitude  or 
knowledge . 

On  the  other  hand.  Dr.  Kramer's  results  show  only 
limited  correlations  across  audience  types  between  amount  of 
exposure  and  amount  of  attitude  and  information  change.  It 
is  only  partially  true  that  the  groups  most  exposed  changed 
most. 

How  can  we  reconcile  these  superficially  contradictory 
results?  To  some  degree  it  must  be  recognized  that  some  flaws 
in  the  research  design  may  be  involved.  Data  on  radio  broad¬ 
cast  content  from  1947  no  longer  existed  in  1968.  Dr.  Kramer 
therefore  postulated  that  radio  news  flashes  covered  the  same 
topics  as  did  the  newspaper  front  pages.  Thus  even  if  there 
were  significant  differences  between  the  radio  and  newspaper 
audiences,  differences  in  what  they  received  in  the  simulation 
would  be  attenuated  by  the  similarity  of  the  scenario  offered 
them.  What  could  appear  in  the  simulation  results  would  only 
be  differences  in  what  reached  the  audiences  of  the  different 
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Themes  in  order  of 
Frequency 


Number  of 
times  counted 

in  content  Per  Cent  Changing  Attitude^ 
analysis  or  Knowledge  in  NORC  Panel 


3.  Violence#  threats  to 

peace#  war  2202 

2.  Russian -American 

relations  752 

1.  U.N.  peacekeeping  595 

4.  Great  Power  discus¬ 
sion  in  U.N.  357 

5-l 

6.S  Veto  234 

7J 

9.  Campaign  sponsors  142 

111)  Satisfaction,  dissatis- 
12j  faction  with  U.N.  138 

10.  Explanations  of  U.N.  43 

8.  U.N.  and  human  rights  32 

* 

Calculated  from  pp.  410  ff.#  below. 


Expect  war  within  10 
years  . 228 

War  main  problem 

facing  U.S.  .213 

USSR  main  problem- 
facing  U.S.  .191 

More  interested  in 
U.N.  because  situa¬ 
tion  more  critical  .209 

Pessimistic  that  U.N. 
will  succeed  despite 
disagreements  . 177 

Knowledge  of  veto  .090 

Know  names  of  cam¬ 
paign  sponsors  . 059 

Dissatisfied  with 

U.N.  .027 

Knowledge  of  various 
aspects  of  U.N.  (aver¬ 
age  over  6  items)  .050 


U.N.  job  to  guarantee 
equal  rights 
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newspapers • and  between  the  reach  of  the  kind  of  major  story 
carried  on  the  front  page  and  the  kind  of  deeper  background 
material  carried  inside.  Thus  variations  in  messages  received 
that  might  correlate  with  attitude  change  across  groups  are 
weaker  than  we  would  like  them  to  be  in  an  ideal  test  and 
weaker  than  they  have  been  in  other  cases  to  which  we  have 
applied  the  simulation. 

Even  if  that  had  not  been  true,  however,  there  would 
still  be  reasons  why  across-groups  correlations  would  come  out 
weak,  reasons  that  may  well  have  reduced  whatever  residual 
correlations  the  data  structures  would  have  allowed.  Let  us 
ask  ourselves  what  it  is  that  gives  impact  to  a  theme.  Is  it 
its  absolute  frequency  or  is  it  its  frequency  as  a  percent  of 
messages  received?  Consider  the  comparison  of  an  educated 
middle  class  person  who  reads  and  listens  to  much  news  with 
an  uneducated  lower  class  person  who  has  little  interest  in 
public  affairs.  The  former  is  apt  to  have  rather  well  formed 
political  opinions;  the  latter  rather  amorphous  ones.  The 
former  reads  and  hears  much  more  news  of  all  sorts.  In  that 
total  there  will  be  more  items  about  the  U.N.  But  these  pre¬ 
sumably  have  no  more  impact  on  the  well  formed  views  of  this 
well  read  man  than  the  few  items  about  the  U.N.  that  reach 
the  more  malleable  cognitive  structure  of  the  less  informed 
man.  Indeed,  if  we  apply  the  findings  of  such  studies  as  Voting , 


we  might  expect  even  more  changeability  among  the  less  in¬ 
formed  than  among  the  well  informed.  That,  indeed,  is  what 
we  do  find  in  this  study.  Dr.  Kramer  correlated  citizen 
exposure  to  each  theme  via  the  mass  media  with  growth  of 
information  about  foreign  affairs  between  the  two  waves  of 
the  NORC  panel  survey.  (The  correlation  was  an  ecological 
correlation  across  respondent  types.)  He  used  two  measures 
of  ncreased  information:  large  increase  of  information,  i.e. 

3  t<  ~  more  correct  answers  given  to  the  five  questions  asked 
at  t.  »  end  of  the  six  months  than  at  the  beginnings  and  moder¬ 
ate  increases  of  information,  i.e.  1  or  2  more  correct  answers 
given  at  the  end  than  at  the  beginning.  Clearly  to  have  recorded 
a  large  increase  in  knowledge  a  respondent  must  have  started  out 
quite  ignorant,  at  very  best  getting  two  out  of  five  questions 
right.  It  follows  that  the  audience  types  among  whom  large 
increments  of  knowledge  occurred  were  audience  types  that 
were  not  much  involved  in  foreign  affairs.  They  consisted  of 
people  who  knew  little  because  they  read  little.  On  the  other 
hand,  moderate  increases  of  information  on  the  five  question 
test  could  be  the  result  of  high  exposure  to  media  messages. 

That  explains  what  at  first  glance  may  be  a  puzzling 
finding  in  Dr.  Kramer's  results.  There  is  a  negative  correla¬ 
tion  of  .7  across  audience  types  between  exposure  to  the  media 
and  large  increases  of  information  and  a  positive  correlation 
of  .7  between  exposure  and  moderate  increases  of  information. 
That  is  as  it  should  be.  Furthermore,  by  a  small  amount,  the 
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latter  correlation  was  highest  for  those  themes  most  fully 
covered  in  the  media:  war  and  peace,  and  US-USSR  relations. 
That  is  also  the  direction  it  should  be. 

Those  results  provide  support  for  the  realism  of  the 
simulation.  So  do  a  few  other  results  of  Dr.  Kramer's  vali¬ 
dation  effort.  Those  persons  who  habitually  use  radio  for 
news  are  by  all  evidence  less  informed  and  less  interested 
in  world  affairs  than  those  who  use  newspapers  and  meetings. 
Exposure  to  various  themes  in  the  simulc  ion  correlated  nega¬ 
tively  with  interview  mention  of  radio  as  a  news  source  and 
positively  with  mention  of  newspapers  and  meetings. 

We  note  also  that  the  six  month  period  of  the  inter¬ 
view  study  was  one  of  growing  dissatisfaction  with  the  U.N. 
due  to  international  conflict  and  growing  fear  of  war.  The 
extent  of  dissatisfaction  was,  however,  negatively  correlated 
with  exposure  to  the  news.  Better  informed  people  were  less 
volatile  in  discouragement.  In  this  instance  differential 
exposure  to  different  themes  makes  a  clear  difference.  There 
is  only  a  -.3  correlation  between  amount  of  exposure  to  the 
general  news  coverage  of  war,  peace,  and  Soviet-American  con¬ 
flict  and  d5.ssatisfaction  about  the  process  of  the  U.N. ,  but 
there  is  a  -.6  correlation  between  such  dissatisfaction  and 
exposure  to  such  matters  as  explicit  explanation  of  the  U.N. 

Finally,  Dr.  Kramer  finds  a  +.5  to  +.6  correlation 


between  exposure  to  the  various  media  messages  and  increased 
knowledge  about  the  U.N.  veto,  but  substantially  no  correlation 
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between  media  exposure  and  knowledge  about  such  other  U.N. 
matters  as  human  rights,  health,  or  international  trade  acti¬ 
vities.  It  will  be  recalled  that  the  simulation  provided 
estimates  that  the  average  male  received  about  nine  news 
messages  on  the  veto  in  the  six  months.  The  corresponding 
figure  for  the  U.N.  and  human  rights  is  one  message  in  six 
months  and  for  the  U.N.  and  health  and  trade,  to  the  nearest 
approximation,  zero.  Thus  the  simulation  results  are  once 
more  supported  as  reasonable  by  data  from  the  real  world. 
Attitude  changes  appeared  in  the  NORC  interviews  where  the 
estimated  amount  of  exposure  to  a  theme  reached  a  critical, 
even  if  quite  modest,  level.  Where  the  exposures  were  minimal, 
the  attitude  changes  were  also  minimal  or  random. 

Thus  even  those  negative  results  which  Dr.  Kramer 
found  in  his  validation  effort  are  on  second  thought  plausible. 
It  is  plausible  that  themes  which  get  so  little  attention  that 
they  disappear  in  the  noise  of  a  mass  media  system  have  no 
measurable  impact  on  attitudes.  It  is  also  plausible  that  a 
mass  media  system  which  delivers  substantially  the  same  vehicles 
of  information  to  the  whole  public,  will  produce  very  similar 
correlations  across  groups  between  attitude  changes  and  the 
frequency  of  exposure  to  any  particular  theme.  To  a  substan¬ 
tial  extent  exposure  to  any  one  theme  was  an  index  to  exposure 
to  the  highly  uniform  mass  media  system  as  a  whole,  with  all 
the  themes  it  contained.  Furthermore,  personal  influence  and 
word  of  m<  ith  communication,  which  are  not  represented  in  our 
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model,  keep  the  society  bound  together  in  relatively  high 
cohesion.  It'  real  events  cause  changes  in  the  expectation 
of  war  among  those  who  follow  the  news  closely,  these  new 
expectations  will  be  carried  to  the  less  attentive  citizens 
from  the  more  attentive  ones  by  word  of  mouth. 

We  are  not  surprised,  therefore,  that  in  a  relatively 
homogeneous  society  like  that  of  the  United  States  we  find 
that  the  news  focus  does  determine  the  issues  about  the  U.N. 
on  which  change  takes  place  and  those  on  which  it  does  not, 
but  that  changes  when  they  do  take  place  at  all,  take  place 

* 

fairly  uniformally  across  broadly  defined  demographic  groups. 

Dr.  Kramer's  findings  in  this  respect  are  highly  plausible. 

The  validation  of  a  complex  simulation  model  is  not 
easy  and  cannot  be  done  by  any  one  set  of  observations.  The 
first  test  of  the  validity  of  a  complex  model  is  the  face 
plausibility  of  its  output.  More  advanced  validity  tests  are 
prediction  or  postdiction  in  situations  where  the  models ' 
parameters  can  be  matched  to  the  real  world;  does  the  simula¬ 
tion  output  follow  a  distribution  that  is  compatible  with  our 
knowledge  of  the  real  world?  But  a  complex  simulation  must 

* 

This  finding  matches  those  in  a  study  of  changing 
attitudes  towards  violence  done  by  this  author  for  the  National 
Commission  on  the  Causes  and  Prevention  of  Violence,  "Trends  in 
Public  Opinion  About  Violence:  1937-1968,"  November  1968,  and  one 
on  attitudes  toward  the  Viet  Nam  war  done  for  the  United  Nations 
Association,  "Trends  in  American  Public  Opinion  on  Viet  Nam," 
March  1969. 
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match  the  world  on  many  observations  before  we  can  give  it  much 
credibility.  The  more  variables  there  are  in  the  model,  the 
more  test  observations  are  required.  And  there  is  no  .  fixed 
point  at  which  the  model  is  proven.  Validation  is  a  matter  of 
degree.  Dr.  Kramer  ran  a  simulation  of  the  Cincinnati  U.N. 
campaign  because  it  is  a  simulation  of  a  situation  on  which  the 
real  world  data  was  relatively  rich.  The  results  of  the  simu¬ 
lation  of  that  situation  carry  us  the  first  steps  in  validation 
of  the  simulation  model.  The  results  obtained  so  far  are  all 
at  peace  with  the  data  that  exist  from  1947.  Certainly  we 
cannot  argue  that  validation  has  been  carried  very  far  by  this 
one  set  of  tests.  It  has,  however,  been  begun.  Dr.  Kramer 
has  shown  extraordinary  care  and  methodological  dexterity,  both 
in  his  contributions  to  the  development  of  the  model,  most  of 
which  was  specified  and  programmed  by  him,  and  in  this  initial 
testing  of  it. 


Ithiel  de  Sola  Pool 
Center  for  International  Studies 
Massachusetts  Institute  of  Technology 
July  1969 


y 


ABSTRACT 

A  Computer  Simulation  of  Audience  Exposure  In  A  Mass  Media 
System:  The  United  Nations  Information  Campaign  in  Cincin¬ 
nati,  1947-1948/  by  John  Francis  Kramer 


During  six  months  of  1947-48,  several  organization? 
in  Cincinnati  conducted  a  public  information  campaign  to 
educate  the  population  of  Cincinnati  about  the  United  Nations . 
An  NORC  panel  survey  which  was  conducted  to  study  the  effects 
of  the  campaign  found  very  slight  changes  in  information  and 
opinions  about  the  U.N.,  but  large  changes  in  concern  about 
war  and  relations  with  the  U.S.S.R.  To  better  understand 
these  effects,  we  have  used  a  simulation  of  the  Cincinnati 
mass  media  system  to  predict  the  frequency  and  reach  of  the 
flow  of  messages  in  the  system  from  known  facts  about  the  popu 
lation,  its  media  habits,  and  the  messages  in  the  mass  media. 

u,  ca  about  the  population  and  its  media  habits  were 
taken  from  census  statistics  and  newspaper  and  radio  audience 
studies.  A  content  analysis  of  the  press  provided  a  set  of 
messages  relevant  to  the  attitudes  and  opinions  measured  by 
the  NORC  survey;  radio  messages  were  postulated  from  the  news¬ 
paper  messages.  The  themes  of  messages  thus  identified  by  the 


content  analysis  served  as  input  to  the  simulation. 

Before  running  the  actual  scenario,  we  ran  three 
trial  scenarios  to  test  the  plausibility  of  the  resulting 
exposures  and  the  consistency  with  which  the  model  synthe¬ 
sized  the  input  data  for  the  media  system.  We  found  that 
the  data  were  consistently  synthesized  and  that  the  expo¬ 
sure  output  wis  plausible. 

For  the  twelve  real  themes  we  described  by  sex, 
education,  and  socio-economic  status,  the  growth  and  dis¬ 
tribution  of  exposure,  the  most  and  least  exposed  population 
subgroups ,  the  .luplication  of  exposure  across  themes ,  and 
the  most  important  media  types  and  vehicles  in  producing 
exposure  to  these  themes.  We  found  that  those  themes  which 
seemed  most  closely  related  to  the  large  changes  in  the  NORC 
panel  were  also  those  themes  for  which  the  simulation  pre¬ 
dicted  the  highest  average  frequency  of  exposure  for  the 
population.  Continuing  the  analysis,  we  proposed  two  basic 
models  relating  information  or  attitude  change  to  frequency 
of  exposure  and  then  attempted  to  correlate,  across  sixteen 
population  subgroups,  the  predicted  exposure  to  the  themes 
with  changes  in  attitudes  and  information  and  recall  of  expo¬ 
sure  in  the  NORC  panel.  We  found  that  the  distribution  of 
exposure  across  the  population  subgroups  was  relatively 
constant  from  theme  to  theme  and  that  the  resulting  cor¬ 
relations  were  small  and  not  easily  interpreted.  We  concluded 
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that  any  effects  of  the  mass  media  themes  in  the  panel  at 
the  subgroup  level  were  probably  confounded  by  variations 
in  prior  exposure  and  exposure  via  informal  communications 
channels.  Finally,  we  offered  several  specific  suggestions 
for  increasing  the  usefulness  of  the  model  by  improving  the 
data  base  and  modifying  the  output  statistics  and  organization. 
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INTRODUCTION 


THE  GENERAL  MASS  MEDIA  SYSTEM 


A  mass  media  system  in  a  typical  city  might  be 
composed  of  several  newspapers ,  several  television  stations , 
five  to  ten  radio  stations,  magazines,  transit  advertising, 
billboards,  etc.  Surely  the  number  of  exposures  of  indi¬ 
viduals  to  messages  carried  in  the  mass  media  is  many  times 
as  large  as  the  population.  Often  it  is  conceivable  that 
most  of  the  media  are  carrying  a  story,  a  message,  or  a 
common  theme  simultaneously.  How  is  it,  then,  that  great 
proportions  of  a  population  hardly  ever  become  acquainted 
with  some  of  the  themes  carried  by  the  mass  media?  With 
all  the  different  audiences  characteristic  of  the  various 
vehicles  in  the  mass  media,  how  can  one  understand  and 
predict  which  segments  of  the  population  will  be  exposed 
to  given  themes  carried  in  certain  parts  of  the  mass  media? 
These  are  the  questions  which  we  have  posed  in  this  thesis 
and  which  we  attempt  to  answer  bv  using  a  computer  model  of 
the  mass  media  system.  The  simulation  can  bring  order  out 
of  the  chaos  of,  at  a  minimum,  tens  of  different  audiences 
to  the  various  newspapers,  radio  broadcasts,  television 
broadcasts,  magazines,  etc.  We  will  attempt  to  show  that 
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several  assumptions  plus  the  integration  of  several  kinds 
of  data  can  go  a  long  way  toward  explaining  why  certain 
members  and  groups  of  the  population  are  repeatedly  exposed 
to  themes  in  the  mass  media  while  other  groups  are  rarely, 
if  ever,  exposed. 

The  Possibility  of  a  Mass  Media 
Simulation 

Students  of  public  opinion  from  political  scientists 
to  Madison  Avenue  advertising  executives  have  long  wished 
to  explain  and  understand  the  flew  of  messages  and  informa¬ 
tion  in  a  given  public  or  audience.  Many  studies  have  been 
performed  since  the  beginning  of  World  War  II  on  elements 
of  this  system  of  information  and  message  flow.  Some  of 
these  studies  have  focused  on  the  channels  of  communication, 
contrasting  formal  and  informal  channels,  the  mass  media 
versus  word-of-mouth  communication.  Other  studies  have 
emphasized  the  number  and  content  of  the  messages  in  these 
systems  and  the  factors  which  control  them.  Still  other 
studies  have  attempted  to  describe  the  audiences  of  certain 
channels  and  for  given  messages  and  the  effects  which  the 
messages  have  on  these  audiences.  Occasionally  someone 
would  study  a  combination  of  these  factors  and  the  interac¬ 
tions  between  them.  These  studies  have  provided  empirically 
based  insights  into  the  workings  of  various  elements  of  the 
total  communications  systems  for  a  public,  but  until  very 
recently  we  have  not  had  the  means  to  integrate  all  the 


information  produced  into  a  description  of  the  total  working 
of  a  communication  system.  However  with  the  advent  of  high 
speed,  large  capacity  computers  it  has  become  feasible  to 
attempt  an  integration  of  many  of  the  findings  of  previous 
studies  into  a  simulation  of  a  rather  large  part  of  the 
communication  system  of  a  given  population. 

In  1961  and  1962,  the  Simulmatics  Corporation 
developed  an  early  model  of  a  mass  media  system  which  proc¬ 
essed  data  about  the  average  audiences  of  various  communi¬ 
cations  vehicles  and  the  (real  or  hypothetical)  messages 
programmed  over  these  vehicles,  and  reported  exposures  in 
various  classes  of  the  population  and  their  frequency  and 
cost.1  This  early  model  was  developed  under  the  direction 
of  Professor  Ithiel  de  Sola  Pool  of  MIT.  In  April  of 
1963  the  Advanced  Research  Projects  Agency  (ARPA)  of  the 
Defense  Department  agreed  to  fund  at  the  Center  for  Inter¬ 
national  Studies  at  MIT  a  program  of  research  on  communica¬ 
tion  in  several  European  and  Asian  communist  countries 
under  the  direction  of  Professor  Pool.  Included  in  this 
project  was  an  attempt  to  build  a  computer  simulation  of  a 
media  system  which  might  integrate  many  of  the  various 
kinds  of  aggregate  data  gathered  in  the  research  project 
and  attempt  to  predict  from  this  aggregate  data  on  audience 

1The  model  is  reported  in  a  pamphlet  by  the  Simul¬ 
matics  Corporation  entitled  Simulmatics  Media  Mix:  Tech- 
nical  Description  (New  York:  The  Simulmatics  Corporation, 
1962)  : 


behavior  and  from  the  location,  content,  and  format  of 
messages  in  a  media  system  such  variables  as  the  types  of 
people  exposed  with  what  frequency,  the  reach  or  coverage 
of  a  given  message  or  theme  of  messages,  the  time  required 
for  the  spread  of  the  message  or  the  rate  of  penetration  of 
a  theme  or  message  in  the  population,  the  duplication  be¬ 
tween  the  various  kinds  of  messages  or  themes,  etc.  From 
that  time  to  the  present  many  social  scientists  and  graduate 
students  have  worked  in  some  degree  at  implementing  the 
simulation  part  of  this  communications  project.  Two  of  the 
algorithms  used  in  the  present  simulation  model  bear  the 
names  of  Professor  Frederick  Mosteller  of  Harvard  university 
and  Professor  Robert  Abelson  of  Yale  University  who  sug¬ 
gested  solutions  to  rather  knotty  computational  problems 
which  arose  in  developing  the  model.  The  responsibility 
for  the  final  programming  and  implementation  of  the  model 
and  the  integration  of  many  of  the  tentative  solutions 
proposed  by  the  various  people  working  on  the  model  over 
the  years,  plus  solutions  to  new  problems  which  had  not 
been  foreseen  and  a  complete  restructuring  of  the  order  of 
the  simulation  and  the  method  of  calculation  of  the  outputs 
of  the  simulation  (which  was  necessary  to  make  it  indeed 
feasible  to  run  such  a  simulation)  devolved  upon  two  grad¬ 
uate  students  at  MIT  during  1966  and  1967  under  the  direc¬ 
tion  of  Professor  Pool,  John  F»  Kramer  and  Herbert  J. 
Selesnick.  This  thesis  is  an  attempt  at  a  partial  validation 
of  the  resulting  model. 
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There  are  several  different  measures  of  the  validity 
of  a  model  (e.g.  its  "face"  plausibility,  the  amount  of  its 
unaccounted  for  variability,  etc.),  but  one  of  the  most 
stringent  and  therefore  convincing  measures  (when  success¬ 
ful)  is  the  comparison  of  the  predictions  of  the  model  with 
empirical  data.  In  order  to  so  test  the  simulation,  we 
needed  data  which  would  enable  us  to  describe  a  mass  media 
system  in  terms  of  the  parameters  of  the  model,  to  locate 
and  analyze  a  set  of  messages  in  that  system  over  a  spec¬ 
ified  time  interval,  and  to  measure  <-  ie  exposure  of  the 
population  to  these  messages  during  that  interval.  If  this 
last  criterion  was  impossible  to  meet,  we  could  measure  the 
population  changes  in  relevant  attitudes  and  information 
during  the  interval  and  attempt  to  correlate  these  with 
exposures  predicted  by  the  simulation. 

A  search  for  a  situation  in  which  all  three  kinds 
of  data  were  present  lead  us  to  the  well-known  1947-48  U.N. 
information  campaign  in  Cincinnati.  Concurrent  ;ith  th. 
campaign,  the  National  Opinion  Research  Center  (NORC)  con¬ 
ducted  in  Cincinnati  several  surveys  of  attitudes  and  in¬ 
formation  relating  to  the  United  Nations  and  other  interna¬ 
tional  news.  With  these  data  in  hand  we  looked  for  the 
other  required  data  and  for  the  most  part  have  been  able  to 
piece  together  enough  data  from  many  sources  to  describe 
the  mass  media  system  and  the  flow  of  relevant  messages  in 
it  during  the  six  months  of  the  Cincinnati  campaign. 
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Examining  closely  the  changes  reported  in  the  panel 
by  the  NORC  data,  we  found  that  the  greatest  changes  oc¬ 
curred  not  in  opinions  and  information  relating  to  the 
U.N. ,  but  in  the  aceas  of  expectations  of  war,  relations 
with  Russia,  and  control  of  the  atomic  bomb.  For  this 
reason,  a  substantial  portion  of  the  research  was  in  these 
areas,  with  the  goal  of  correlating  the  amount  of  change 
measured  in  various  subgroups  of  the  Cincinnati  population 
with  exposure  of  the  subgroups  to  relevant  messages  as 
predicted  by  the  mass  media  simulation. 

Chapter  I  presents  a  brief  summary  of  the  Cincinnati 
campaign  and  a  description  of  the  simulation  model.  In 
Chapter  II,  we  look  in  some  detail  at  one  of  the  most  im¬ 
portant  routines  in  the  simulation,  the  program  for  esti¬ 
mating  several  model  parameters  from  incomplete  data.  The 
next  five  chapters  outline  the  many  levels  of  analyzing  and 
organizing  the  input  data  for  the  simulation.  Chapters 
1II-VI  describe  the  construction  of  the  mass  media  system, 
i.e.  the  specification  of  the  model  to  the  Cincinnati  pop¬ 
ulation  and  mass  media  vehicles  as  they  were  structured  in 
1947-48.  The  content  analysis  and  description  for  input 
to  the  simulation  of  the  actual  messages  during  the  campaign 
are  presented  in  Chapter  VII.  Finally  in  Chapter  VIII  we 
discuss  tha  simulation  output  from  six  trial  and  twelve 
real  themes  of  messages.  We  find  that  the  simulation  does 
consistently  synthesize  and  reproduce  the  input  data  and 


that  it  produces  plausible  exposure  values  for  the  real 
themes  of  messages.  We  conclude  with  an  evaluaticn  of  the 
simulation  attempt  and  several  suggestions  concerning  the 
strategy  of  simulation. 

The  simulation  model  was  originally  programmed 
entirely  for  the  Project  MAC  time-sharinq  system  on  the  IBM 
7094  computer.  Interaction  between  the  computer  and  the 
researcher  is  greatly  facilitated  by  this  system;  often 
the  researcher  can  type  in  his  data,  immediately  view  the 
consequences,  and  accordingly  modify  the  data  if  desired 
The  current  simulation  model  allows  up  to  64  media  vehicles, 
with  a  maximum  simulated  population  of  3000  persons,  distri¬ 
buted  across  a  maximum  of  400  population  subgroups.  The 
last  link  of  the  simulation,  which  perforins  the  actual  pro¬ 
cessing  of  the  messages  and  record  .ng  of  exposures,  is  pre¬ 
sently  being  reprogrammed  for  the  t-tch-processing  IBM 
360-65.  The  larger  capacity  of  this  machine  ana  increased 
processing  speed  will  allow  more  detail  and  easier  handling 
of  the  output  statistics. 
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CHAPTER  I 

SIMULATING  EXPOSURE  TO  INTERNATIONAL  AFFAIRS 

IN  CINCINNATI 

The  Cincinnati  U.N.  Campaign 

During  the  six-month  period  from  September  1947  to 

March  1948,  the  American  Association  fo’  the  United  Nations 

and  the  United  Nations  Association  of  Cincinnati  conducted 

one  of  the  best  documented  and  most  conspicuous  failures  of 

a  mass  educational  campaign  to  be  found  in  the  literature 

1 

of  public  opinion  and  propaganda.  "In  an  effort  to  stim¬ 
ulate  interest  and  convey  information  about  the  United 

Nations  [the]  two  organizations  literally  bombarded  the 

2 

city  of  Cincinnati"  with  information  in  an  intense  cam¬ 
paign  whose  "objective  was  to  reach  in  one  way  or  another 
every  adult  among  1,155,703  residents  in  Cincinnati's 

3 

retail  trading  zone."  The  newspapers  played  up  United 


Reported  in  Shirley  A.  Star  and  Helen  MacGill 
Hughes,  "Report  on  an  Educational  Campaign:  The  Cincinnati 
Plan  for  the  United  Nations,"  American  Journal  of  Sociology, 
January  1950,  pp.  389-400. 

2 

Raymond  A.  Bauer  and  Alice  H.  Bauer,  "America, 

Mass  Society  and  Mass  Media,"  The  Journal  of  Social  Issues, 
Vol.  XVI,  No.  3,  1960,  p.  12. 

3Star,  op.  cit. ,  p.  393. 
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Nations  news  and  information;  for  one  week  one  hundred  and 
fifty  spot  announcements  were  broadcast  over  one  of  the 
radio  stations.  "In  all,  59,588  pieces  of  literature  were 
distriouted  and  2,800  clubs  were  reached  by  speakers  sup¬ 
plied  by  a  speakers*  bureau  and  by  circular,  hundreds  of 

4 

documentary  films  were  shown  ..." 

The  magnitude  of  the  failure  of  the  campaign  to 
stimulate  interest  in  the  U.N.  is  well  documented  by  two 
NORC  surveys  conducted  at  the  beginning  and  at  the  end  of 
six-months  period.5  With  respect  to  the  level  of  informa¬ 
tion  "the  before  and  after  scores  remained  remarkably  con¬ 
stant;  for  example,  in  September,  34  percent  said  they  had 
heard  of  the  United  Nations*  veto  power  and  7  percent  could 
explain  how  it  worked;  in  March  these  figures  were  almost 
unchanged — 37  percent  and  7  percent."6 

During  the  six  months,  however,  there  was  a  violent 
conflict  raging  in  Palestine  between  Arabs  and  Jews,  and  in 

4 Ibid. ,  p.  392. 

5 

These  surveys  include  a  panel  of  745  persons  (of 
which  20%  were  lost  or  refused  to  be  reinterviewed)  plus  a 
second  sample  of  758  adults  interviewed  in  March.  The  ques¬ 
tions  included  measures  of  information,  interest,  attitude 
toward  the  United  Nations  and  general  international  affairs 
news  and  for  the  Mar  h  interviews  measures  of  exposure  to 
the  information  and  the  channels  by  which  the  respondents 
were  exposed.  Full  details  of  the  surveys  appear  in  Cinci¬ 
nnati  Looks  at  the  United  Nations  and  Cincinnati  Looks 
Again,  being  Reports  Nos.  37  and  37A  o£  the  National  Opin- 
lon  Research  Center,  University  of  Chicago,  Chicago,  1948. 

6Star,  op.  cit.,  p.  392. 
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the  U.N.  somewhat  less  violent  conflict  over  the  partition 
of  Palestine.  East  and  West  were  regularly  denouncing  each 
other,  not  only  over  the  issue  of  Palestine,  but  also  over 
the  use  or  misuse  of  the  United  Nations  in  relation  to  the 
issue.  We  shall  see  in  Chapter  VII  that  these  were  the 
themes  predominant  in  the  mass  media  and  it  is  therefore 
not  surprising  that  the  large  changes  in  public  opinion 
were  in  this  area.  For  example,  in  September,  1947 ,  24 
percent  of  the  sample  named  war  or  peacekeeping  as  the  most 
important  problem  facing  the  country;  by  March,  1943  this 
figure  had  risen  to  45  percent.  The  percent  in  September 
expecting  war  in  the  next  ten  years  was  49  percent;  in 
March  it  was  72  percent. 

Since  our  model  is  a  model  of  the  flow  of  messages 
in  the  mass  media,  and  much  of  the  effort  of  the  information 
campaign  was  directed  into  less  formal  channels  of  communi¬ 
cation  which  we  did  not  model,  we  should  not  consider  this 
simulation  a  replication  of  the  total  information  campaign, 
but  only  of  the  relevant  messages  in  the  mass  media,  plus 
other  international  news  messages  which  relate  to  the 
larger  opinion  changes  observed  in  the  Cincinnati  panel. 
Before  we  look  further  into  the  data  relevant  to  these 
changes,  we  will  present  the  model  of  a  mass  media  system 
used  in  this  simulation. 
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Strategies  for  Simulating  a  Mass  Media 
Systen 

How  shall  we  model  a  flow  of  messages  in  a  mass 
media  system?  We  might  choose  one  of  two  strategies  for 
building  a  system  to  describe  the  flow  of  messages.  The 
first  strategy  would  make  relatively  few  strong  assumptions 
based  on  research  studies  and  intuition  about  the  message 
flow  to  develop  a  general  model,  not  completely  specific  to 
any  given  situation  and  not  strongly  tied  to  data  from  any 
particular  mass  media  system.  Such  a  model  would  be  at  a 
high  level  of  abstraction  and  because  of  its  generality  it 
would  probably  not  be  a  very  accurate  predictor  of  the 
message  flow  in  any  particular  mass  media  system,  but 
rather  a  heuristic  device  which  would  attempt  to  capture 
some  essential  part  of  the  flow  of  messages  across  many 
media  systems.  A  second  way  to  approach  the  simulation  of 
a  mass  mei  .  a  system  is  to  make  a  model  which  incorporates 
all  kinds  of  specific  data  to  model  a  particular  mass  media 
system.  In  this  kind  of  model  the  level  of  abstraction 
would  be  significantly  lower  and  the  heuristic  value,  or 
insights  into  the  functioning  of  the  system,  would  probably 
be  less,  but  the  ability  to  make  predictions  about  a  spe¬ 
cific  system  would  quite  likely  be  enhanced  because  of  the 
use  of  large  masses  of  data  from  the  particular  system.  In 
general  in  social  science  the  following  rule  seems  to  be 
true:  the  higher  the  level  of  generality  of  a  mo^l.  the 
less  useful  it  is  as  a  predictor  in  a  particular  situation. 
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This  latter  approach  is  the  approach  we  have  taken 
in  constructing  the  present  mass  media  simulation.  We  have 
attempted  to  design  a  model  in  a  set  of  computer  programs 
that  will  allow  a  researcher  to  use  as  much  data  of  various 
kinds  as  he  can  muster  pertaining  to  a  particular  mass  media 
system,  incorporating  this  data  into  the  model  to  produce 
flows  of  message  exposures  specified  throughout  the  popula¬ 
tion  for  the  particular  mass  media  system  over  time. 

What  kind  of  data  might  one  have  for  a  particular 
mass  media  system?  Typically  we  find  data  about  the  au¬ 
diences  of  the  vehicles  of  the  mass  media  system  and  how 
they  distribute  themselves  throughout  the  population.  By 
this  we  mean  that  the  audiences  are  described  in  terms  of 
their  composition  by  sex,  age,  education,  level  of  economic 
status,  race,  and  perhaps  by  other  demographic,  social, 
political,  or  public  opinion  dimensions.  These  data  come 
from  surveys  and  studies  often  commissioned  by  the  manage¬ 
ment  of  the  media  vehicles  in  a  particular  area.  Usually, 
at  least  one  of  the  newspapers  in  a  metropolitan  area  will 
cor,  luct  a  study  to  see  what  kinds  of  people  are  reached  by 
the  newspaper  versus  the  kinds  cf  people  the  competitors 
are  reaching  in  order  to  better  make  its  case  for  adver¬ 
tisers.  Of  course,  the  ratings  for  radio  and  television 
and  the  description  of  the  audiences  of  these  vehicles  by 
demographic  and  social  types  is  well  known.  Sometimes  also, 
but  very  rarely  for  daily  media  vehicles,  there  are  data 


which  describe  the  growth  of  the  audience  exposed  at  least 
once  to  the  communication  vehicle  after  several  issues  of 
the  vehicle  have  appeared.  This  net  coverage  of  the  vehicle 
after  a  given  number  of  issues  we  call  the  cumulation  or 
cumulative  audience  of  the  vehicle  after  several  issues. 
These  data  are  particularly  important  because  they  give  an 
idea  of  a  differentiation  within  a  population  between 
people  who  are  almost  constantly  exposed  to  the  vehicle  and 
people  who  are  much  less  frequently  exposed  to  the  vehicle. 
In  addition,  we  occasionally  will  know  the  average  dupli¬ 
cated  audience  between  some  pairs  of  vehicles. 

The  data  which  we  have  described  above  represents 
the  best  possible  situation  which  one  actually  encounters 
in  attempting  to  describe  empirically  the  mass  media  system. 
We  note  that  these  data  are  aggregated  or  macro-data  of 
average  statistics  about  groups  in  the  population.  Usually 
the  data  available  for  a  system  is  much  sparser  than  the 
data  described  above.  For  some  or  many  of  the  vehicles, 
often  not  even  the  average  audiences  are  known;  these  must 
be  estimated  from  some  other  Kind  of  information,  perhaps 
by  drawing  analogies  to  other  situations  where  such  data 
are  known.  In  constructing  the  simulation  we  have  made  the 
assumption  that  some  macro-data  are  available,  but  that 
they  are  not  complete,  e.g.  that  we  will  not  have  deta  on 
the  distribution  of  the  audience  of  every  vehicle  across 
every  conceivable  population  type  and  that  we  must  have 


some  method  for  estimating  these  missing  statistics  if  we 
are  to  form  a  working  model  of  the  mass  media  system. 
Professor  Frederick  Mosteller  of  Harvard  University  has 
suggested  a  parameter  estimation  iteration  for  generating 
full  tables  from  such  partial  tables. 

Let  us  look  now  at  the  kind  of  results  or  output 
which  we  might  like  from  a  simulation  of  a  mass  media 
system.  What  uses  will  we  make  of  such  a  simulation  and 
what  kinds  of  manipulations  shall  we  attempt  with  it?  We 
should  like  to  build  a  model  or  a  black  box  which  will 
answer  the  question  of  who  gets  exposed  how  frequently  over 
what  period  of  time  via  what  channel  to  what  messages  in 
the  mass  media.  At  a  later  stage  we  may  want  at  least 
implicitly  to  go  beyond  this  to  examine  the  effect  of  these 
exposures;  at  this  time  however  we  limit  ourselves  to  the 
question  of  exposure.  We  may  want,  as  with  the  present 
research,  to  do  a  content  analysis  of  some  mass  media 
system  to  produce  a  scenario  of  real  messages  from  which 
we  may  predict  the  rates  of  exposure,  frequencies  of  ex¬ 
posure,  and  distribution  of  exposures  in  the  population  of 
that  mass  media  system.  Or  having  once  established  the 
validity  of  the  model,  we  may  want  to  undertake  studies  of 
hypothetical  sets  of  messages  in  the  media  system  to  see 
what  kind  of  exposures  we  would  expect  under  different 
conditions.  For  example,  we  might  wish  to  discover  which 
strategy  would  be  most  effective  in  exposing  those  leant 
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informed  about  some  international  issue  to  messages  explain¬ 
ing  the  nature  of  that  internationar  issue.  A  model  pro¬ 
ducing  a  record  of  exposures  in  the  manner  we  have  described 
above  could  be  quite  helpful  in  planning  such  a  campaign, 
showing  he  most  efficient  way  to  invest,  resources  to 
achieve  the  desired  kinds  of  exposure. 

How  shall  we  build  this  kind  of  model?  One  approach 
would  be  to  collect  average  exposure  data  for  each  rela¬ 
tively  homogeneous  subgroup  of  the  population  and  then  to 
aggregate  these  data  over  messages  and  subgroups  to  produce 
the  desired  exposure  information.  Let  us  consider  which 
statistics  would  be  required  for  some  relatively  homogeneous 
subgroup  in  the  population  of  the  media  system.  As  a  theme 
of  messages  appears  we  would  like  to  know  for  each -subgroup 
the  frequency  distribution  of  exposures  at  each  time  period 
of  the  simulation.  Since  data  are  usually  available  only 
by  vehicles  and  since  themes  of  messages  will  normally 
appear  in  several  vehicles,  in  order  to  produce  this  fre¬ 
quency  distribution  of  exposures,  we  would  be  forced  to 
know  for  a  particular  vehicle  the  distribution  of  frequen¬ 
cies  of  exposure  in  each  subgroup  after  the  appearance  of 
a  set  of  messages  in  the  vehicle  and  then  aggregate  over 
the  vehicles  in  which  the  messages  have  occurred.  Of 
course,  data  of  this  detail  on  the  frequency  distribution 
of  exposure  after  n  messages  in  a  vehicle  are  never  avail¬ 
able.  The  closest  approximations  are  the  cumulation  data 
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of  vehicle  (not  message)  exposures  for  the  total  population 
(not  for  subgroups) .  These  data  would  be  very  hard  to 
measure  in  principle  since  the  messages  in  the  various 
vehicles  probably  have  widely  varying  attention- commanding 
characteristics,  characteristics  partly  a  function  of  the 
vehicle,  partly  a  function  of  the  display  and  location  of 
the  message  in  the  vehicle,  partly  a  function  of  the  theme 
or  content  of  the  message,  and  all  of  these  varying  also 
with  the  characteristics  of  the  audience. 

Our  solution  to  these  complexities  has  been  to  ap- 

\ 

proach  the  phenomenon  of  exposure  as  a  sequential  probabil¬ 
ity  process.  A  given  individual  has  a  certain  probability 
of  exposure  to  a  vehicle  and  a  conditional  probability  of 
exposure  to  a  message  in  the  vehicle  depending  upon  all  the 
factors  described  above,  given  that  he  is  first  exposed  to 
the  vehicle.  The  net  result  is  that  a  person's  total  prob- 
rJjility  of  exposure  to  a  given  message  is  a  product  of  two 
probabilities;  his  probability  of  exposure  to  the  vehicle 
multiplied  by  his  probability  of  exposure  to  the  particular 
message  given  exposure  to  the  vehicle  carrying  the  message. 

Even  with  this  formulation  we  might,  for  each  homo¬ 
geneous  subgroup  of  the  population,  derive  a  probability 
distribution  for  exposure  to  each  vehicle  for  the  member  of 
that  population  subgroup.  Then  given  a  particular  message 
with  its  conditional  probability  of  exposure,  we  simply 
multiply  the  two  numbers  together  in  order  to  get  the  net 


39 


probability.  However  this  method  also  has  some  problems; 
unless  we  derive  some  very  complex  joint  probability  dis¬ 
tributions,  we  do  not  know  the  probability  of  a  person's 
being  exposed  to  one  vehicle  in  relationship  to  his  prob¬ 
ability  of  being  exposed  to  a  second  vehicle.  Moreover, 
even  if  we  could  handle  the  data  for  each  subgroup  of  the 
population  in  terms  of  a  joint  probability  distribution  of 
exposures  over  quite  a  lot  of  vehicles,  we  have  absolutely 
no  possibility  of  ever  finding  actual  data,  even  by  an 
estimation  process,  in  the  detail  needed  to  describe  these 
distributions  for  each  population  subgroup.  For  instance, 
if  they  exist  at  ail,  the  duplication  data  between  vehicles 
refer  to  the  entire  population  and  not  for  some  subgroup 
of  the  population.  Likewise,  if  cumulation  data  exist  for 
a  vehicle,  they  describe  cumulation  in  the  entire  popula¬ 
tion;  almost  never  do  they  describe  cumulation  in  several 
subgroups  of  the  population.  Finally  our  routine  for  esti¬ 
mating  missing  parameters  from  known  data  requires  a  minimum 
amount  of  known  data  to  be  confident  that  the  estimates  are 
valid;  the  data  on  cumulation  and  duplication  are  so  sparse 
that  the  validity  of  the  estimation  techniques  seems  quite 
doubtful.  However  if  we  could  achieve  duplication,  cumula¬ 
tion,  and  average  audience  data  for  each  subgroup,  then  we 
could  have  the  simulation  routines  which  now  make  probabil¬ 
ity  assignments  for  the  entire  population  call  themselves 
and  operate  on  the  individual  subgroup.  In  this  case. 
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several  hundred  simulated  individuals  could  represent  mem¬ 
bers  of  that  subgroup  and  the  resulting  statistics  would  be 
weighted  by  an  appropriate  factor  to  reflect  the  size  cf 
that  population  subgroup. 

Because  of  the  problems  above  we  have  compromised 
between  designing  a  model  which  requires  a  complete  de¬ 
scription  of  the  audience  habits  for  each  population  sub¬ 
group  and  a  model  which  operates  at  the  level  of  the  total 
population  and  ignores  completely  the  need  to  differentiate 
between  important  subgroups.  To  describe  the  distribution 
of  the  population  and  the  average  audiences  of  the  various 
communications  vehicles  over  the  population  subgroups  de¬ 
fined  by  the  dimensions  chosen  by  the  researcher,  we  shall 
use  the  parameter  estimation  techniques  on  our  initially 
incomplete  data  to  provide  these  figures  for  each  population 
subgroup-  However  we  will  use  the  considerably  sparser 
duplication  and  cumulation  data  to  generate  vehicle  exposure 
probabilities  for  the  total  population  and  then  attempt  to 
assign  these  probabilities  to  the  population  subgroups, 
consistent  with  the  estimated  subgroup  parameters.  Thus, 
since  the  data  are  so  poor,  we  make  no  attempt  to  reproduce 
on  a  cell-by-cell  basis  the  cumulation  or  duplication 
figures . 

To  summarize,  a  model  of  a  communications  system 
consists  of: 

1.  an  audience,  each  member  of  which  has: 


a.  personal  characteristics  such  as  age,  sex, 
occupation,  place  of  residence 

b.  media  habits,  such  as  subscriptions 

2.  media: 

each  of  which  appears  periodically  in  discrete 
issues,  and  each  of  which  contains  separate  stories 
of  items  in  each  of  which  there  are  certain  themes. 
Among  the  most  important  quantitiative  characteris¬ 
tics  of  a  media  system  are: 

1.  the  size  of  the  population  among  whom  the  audience 
is  found 

2.  the  breakdown  of  that  population  into  cells  defined 
by  the  personal  characteristics  of  the  individuals 

3.  the  audience  of  each  vehicle  (which  is  the  circula¬ 
tion  of  that  vehicle  times  the  number  of  persons 
exposed  to  each  exemplar  of  the  vehicle) 

4.  the  cumulation  pattern  of  each  vehicle 

5.  the  duplication  patterns  between  vehicles 

The  last  two  concepts,  cumulation  and  duplication,  need  to 
be  defined.  Cumulation  is  the  proportion  of  the  population 
that  has  been  exposed  to  a  vehicle  aft_-r  n  issues.  For 
example,  magazines  tend  to  inform  their  advertisers  as  to 
the  proportion  of  the  American  public  who  will  have  seen  at 
least  one  out  of  one,  one  out  of  two,  one  out  of  three,  or 
one  out  of  four  issues  of  the  magazine.  Duplication  is  the 
intersection  between  the  set  of  persons  exposed  to  one 
vehicle  and  the  set  of  persons  exposed  to  another.  Thus 
American  newspapers  in  multi-newspaper  cities  tend  to 


inform  their  advertisers  as  to  what  proportion  of  the  pop¬ 
ulation  see  their  newspaper  alone,  see  both  newspapers,  see 
only  the  other  newspaper,  or  see  neither.  The  extent  of 
duplication  is  the  extent  of  persons  exposed  to  both. 

The  Simulation  Model 

We  have  discussed  above  the  difficulties  in  pro¬ 
ducing  a  frequency  distribution  of  exposures  for  each 
population  type  in  such  a  way  as  to  integrate  all  the  known 
data  about  cumulation  and  duplication  for  several  media 
vehicles.  /  Our  approach  to  this  problem  of  integration  of 
data  at  the  subgroup  level  and  data  for  the  entire  popula¬ 
tion  is  to  use  a  probabilistic  model  in  which  each  member 
of  the  hypothetical  simulation  population  has  a  probability 
of  exposure  to  each  vehicle  in  the  media  system,  these 
probabilities  of  exposure  being  chosen  such  that  a)  the 
average  audience  within  the  population  subgroup  or  cell  is 
correctly  reproduced  for  each  vehicle,  b)  the  duplication 
of  audiences  across  vehicles  is  correctly  reproduced  and, 
c)  the  cumulation  in  the  entire  audience  for  each  vehicle 
is  also  correctly  reproduced. 

The  simulation  model  is  easily  conceptualized  in 
two  separate  parts;  in  the  first  part  the  hypothetical 
population  with  its  probabilities  of  exposure  to  the  commu¬ 
nications  vehicles  in  the  mass  media  is  constructed  and 
stored  on  a  disk  or  a  tape  in  a  computer.  This  hypothetical 
population  consists  of  a  correct  proportion  of  individuals 


of  each  population  type  with  each  person's  probabilities  of 
exposure  to  each  of  the  vehicles  in  the  mass  media  system. 
The  second  part  of  the  simulation  takes  as  input  a  scenario 
of  messages  organized  by  themes  and  time  periods ,  which  are 
then  put  into  the  mass  media  system  to  produce  exposure  of 
various  kinds  of  individuals  over  time.  However,  this  is 
not  as  simple  as  it  may  seem  at  first.  Since  the  probabil¬ 
ities  which  have  been  stored  for  each  person  are  only  the 
likelihoods  of  exposure  to  the  vehicle  and  are  not  likeli¬ 
hoods  of  exposure  to  the  message  contained  in  the  vehicle, 
we  must  then  estimate  the  conditional  probabilities  of  ex¬ 
posure  to  the  message  given  exposure  to  the  vehicle.  The 
parameters  for  calculating  these  probabilities  of  exposure 
to  the  messages  given  exposure  to  the  vehicles  are  another 
input  in  the  second  part  of  the  simulation. 

The  First  Stage  of  the  Simulation:  Modeling  the  Vehicle 
Audiences 

Now  we  will  describe  the  sequence  of  operations 
used  in  constructing  the  hypothetical  population  with  its 
probabilities  of  exposures  to  each  of  the  vehicles  in  the 
media  system. 

Breaking  the  population  into  subgroups  with  homo¬ 
geneous  characteristics . — We  start  with  a  population  of  a 
certain  size.  The  first  step  is  to  pick  a  set  of  attributes 
or  dimensions  which  seem  to  explain  much  of  the  audience 
behavior  in  the  mass  media  and  then  describe  the  breakdown 
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of  the  population  in  terms  of  these  attributes.  Since  we 
also  want  to  describe  the  vehicle  audiences  along  these 
dimensions,  the  inputs  in  this  stage  are  usually  census¬ 
like  tables  by  age,  sex,  literacy,  region,  education,  etc. 
From  these  tables  describing  the  breakdown  of  the  population 
we  construct  a  grand  array  of  cells  which  is  the  intersec¬ 
tion  of  all  the  classes  of  the  attributes,  giving  the  num¬ 
ber  of  people  in  the  population  for  every  attribute  combi¬ 
nation.  Often  we  arrive  at  a  problem  here:  the  tables  in 
the  various  data  sources  which  describe  the  population 
breakdowns  rarely  will  include  in  one  table  all  the  dimen¬ 
sions  which  we  wish  to  consider  in  the  simulation.  We 
usually  find  tables  of  three  or  four  dimensions,  at  most. 
This  is  true  for  U.S.  census  tables  and  even  more  so  for 
foreign  census  data.  We  cure  forced  to  construct  a  popula¬ 
tion  based  on  these  tables,  taking  all  the  information 
present,  and  deriving  a  reasonable  population  which  might 
have  given  rise  to  the  tables.  For  this,  we  use  an  itera¬ 
tive  technique  suggested  by  Professor  Frederick  Mosteller 
of  Harvard  which  takes  the  known  data  and  from  it  estimates 
the  unknown  parameters  (cell  sizes)  for  the  table.  The 
resulting  values  are  those  which  contain  all  the  information 
in  the  known  tables,  but  no  other  information.  They  assume 
that  other  than  the  information  in  the  known  subtables,  all 
interactions,  or  nonrandom  effects,  are  zero  in  the  popula¬ 
tion.  Thus,  even  with  incomplete  tables  describing  the 


population,  we  are  able  to  make  a  reasonable  guess  about 
the  distribution  of  the  population  across  all  of  our  rel¬ 
evant  dimensions  and  create  a  table  which  includes  all  the 
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data  from  the  original  tables. 

Distributing  the  audience  over  the  population 
cells. — The  next  step  is  to  perform  exactly  the  same  manip¬ 
ulations  on  the  audience  for  each  of  the  vehicles,  distri¬ 
buting  it  over  the  population  cells.  Fo-  each  cell  describ¬ 
ing  a  combination  of  attributes  along  all  the  dimensions  of 
the  population,  we  have  the  number  of  the  population  in  that 
cell  and  the  audience  of  each  vehicle  in  tliat  cell.  If  we 
divide  the  audience  by  the  population  in  the  cell,  we  then 
obtain  for  each  vehicle  the  rating,  or  the  proportion  of 
the  population  of  that  cell  which  is  in  the  average  audience 
of  the  venicle.  At  this  stage,  then,  we  have  the  population 
scattered  into  cells  plus  cell  mean  exposures,  or  ratings, 
for  each  vehicle  for  the  cell.  At  this  point  we  are  ready 
to  consider  probabilities  of  exposure  to  the  vehicles  for 
each  member  of  the  population. 

Modeling  the  cumulation  process. — If  we  knew  just 
the  total  population  and  the  total  audience  for  the  vehicle, 
then  the  model  with  the  least  differentiation  would  assign 


A  more  complete  description  of  the  parameter  esti¬ 
mation  technique  is  provided  in  Chapter  II, 


every  individual  in  the  population  a  probability  of  exposure 
to  the  vehicle  equal  to  the  ratio  of  the  vehicle  audience 
to  the  total  population.  (Thus  if  the  vehicle  audience  for 
a  newspaper  reached  one-third  of  a  population,  then  this 
model  would  assign  to  each  member  of  the  population  a  prob¬ 
ability  of  exposure  to  that  vehicle  equal  to  one-third.) 

This  model  would  indeed  reproduce  the  average  audience  of 
the  vehicle,  but  it  would  not  usually  reproduce  the  other 
data  describing  how  the  audience  for  the  vehicle  varies 
over  the  types  of  the  population  or  its  duplication  with 
other  vehicles  or  its  cumulation  over  several  issues.  To 
use  the  average  audience  data  known  for  each  population 
type,  we  could  take  the  cell  mean  auc.ience  as  a  mean  prob¬ 
ability  of  exposure  for  each  population  type.  Therefore 
instead  of  one  single  probability  for  each  individual,  we 
would  have  as  many  probabilities  as  we  have  population 
types,  assigning  to  each  member  of  the  population  type  the 
average  probability  for  that  population  type.  This  is 
analogous  to  the  first  situation,  except  that  now  the  pop¬ 
ulation  is  differentiated  into  a  large  number  of  subgroups, 
each  group  having  an  average  probability  of  exposure  which 
generates  average  audiences  equal  to  those  average  audiences 
which  are  either  input  or  estimated  for  the  population  type 
for  a  given  vehicle.  However,  this  further  differentiation 
would  not  necessarily  reproduce  the  known  or  estimated 
cumulation  over  the  entire  population  for  a  single  vehicle. 
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nor  would  it  necessarily  produce  a  known  duplication  in  the 
entire  population  for  any  pair  of  vehicles.  Therefore  we 
turn  now  to  the  cumulation  data  to  calculate  probabilities 
which  will  reproduce  cumulation  for  each  vehicle  over  the 
entire  population. 

The  implications  of  cumulation  for  the  simulation 
model  are  discussed  more  fully  in  Chapter  IV  and  so  we  give 
only  a  brief  outline  at  this  point.  If  the  average  audience 
of  a  vehicle  is  one-third  of  the  population,  we  can  conceive 
of  this  happening  in  many  different  ways.  Each  member  of 
the  population  could  have  a  probability  of  one-third  of 
being  exposed  to  the  vehicle  every  time  the  vehicle  appears. 
In  this  case  the  compos ition  of  the  audience  varies  randomly 
from  issue  to  issue  and  the  total  number  exposed  at  least 
once  to  the  vehicle  (the  cumulative  audience}  grows  with 
each  succeeding  issue.  However  there  is  another  way  to 
make  the  vehicle  audience  equal  to  one-third  of  the  popula¬ 
tion:  we  could  assign  to  one-third  of  the  population  a 
vehicle  exposure  probability  of  1.0  and  to  the  rest  of  the 
population  a  probability  o.:’  0.0.  In  this  case,  every  time 
the  vehicle  appears  one-third  of  the  population  would  ba 
exposed,  but  it  would  be  the  same  one-third  each  time  and 
there  would  be  no  cumulative  growth  in  the  niraber  exposed. 
Obviously  the  cumulation  we  observe  in  the  total  audience 
is  a  function  of  the  distribution  of  exposure  probabilities 
in  the  population  and  by  manipulating  these  probabilities 
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we  can  reproduce  the  known  cumulation.  This  is  the  logic 
behind  our  use  of  cumulation  data  to  develop  probabilities. 

For  each  vehicle,  the  population  is  divided  into 
three  groups.  The  first  group  is  the  distribution  of  very 
frequent  consumers  of  the  vehicle.  These  people  might  be 
the  subscribers  to  a  magazine  or  the  hubitual  viewers  of  a 
television  show.  The  second  group  is  the  distribution  of 
those  people  who  very  infrequently  are  exposed  to  the 
vehicle.  These  people  might  be,  for  example,  all  the  men 
in  the  population  when  the  vehicle  is  a  women's  magazine, 
or  members  of  the  population  who  work  at  the  time  of  a 
given  television  program,  or  those  who  do  not  own  a  set. 

The  third  distribution  are  those  people  with  moderate  prob¬ 
abilities  of  exposure  to  the  vehicle,  i.e.,  just  the 
remainder  of  the  population. 

For  each  of  these  groups  we  must  estimate  the  size, 
the  average  audience,  and  the  two-period  cumulative  audience. 
Making  the  assumption  that  for  each  of  these  groups  the 
distribution  of  exposure  •  probabilities  is  of  a  known  func¬ 
tional  form — that  of  a  mathematical  function  known  as  the 
beta  function — then  the  average  audience  and  the  cumulation 
within  each  group  is  sufficient  to  calculate  the  parameters 
defining  each  of  the  beta  functions,  completely  specifying 
the  distribution  of  probabilities  for  the  population  for 
the  given  vehicle.  These  probabilities  are  then  such  that, 
for  the  entire  population,  they  will  reproduce  the  average 
audience  and  the  two-period  cumulation  of  the  vehicle. 


*  l-*1 


49 

In  calculating  the  distribution  of  the  probabilities 
for  a  given  vehicle,  one  probability  for  each  member  of  the 
population,  we  have  ignored  our  knowledge  of  the  distribu¬ 
tion  of  the  audience  of  the  vehicle  across  the  population 
types  and  the  resulting  average  probabilities  of  exposure 
v/ithin  each  population  type.  How  shall  we  relate  these 
many  average  probabilities  of  exposure  to  the  set  of  prob¬ 
abilities  generated  from  the  cumulation  data?  The  problem 
is  to  place  individual  probabilities  from  the  cumulation 
distribution  into  the  population  cells,  one  probability  for 
each  person  in  the  cell,  such  that  these  probabilities  re¬ 
produce  the  known  average  probability  for  the  cell.  For 
cells  that  have  a  very  high  rating,  or  mean  probability  of 
exposure,  it  is  obvious  that  the  selection  of  probabilities 
to  assign  to  these  cells  should  come  from  the  highest  ex¬ 
posure  group.  In  the  same  way,  cells  with  very  low  mean 
exposure  probabilities  should  have  probabilities  chosen 
from  the  low  probability  group.  The  actual  assignment  is 
made  in  the  following  manner:  the  mean  exposure  probability 
of  the  cell  is  compared  with  the  means  for  each  of  the 
three  distributions  and  a  distribution  is  selected  whose 
mean  is  closest  to  the  mean  of  the  cell.  A  probability  is 
then  drawn  randomly  from  this  distribution  and  assigned  to 
the  cell  and  eliminated  from  the  distribution.  This  prob¬ 
ability  will  then  account  for  part  of  the  audience  expected 
in  the  cell.  We  can  subtract  this  contribution  from  the 
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expected  audience  in  the  cell  and  calculate  a  new  mean  cell 
probability  for  the  remaining  probabilities  to  be  selected. 
This  new  mean  probability  for  the  cell  is  then  compared 
with  the  three  exposure  distribution  means  and  the  closest 
distribution  is  chosen  as  the  distribution  from  which 
another  random  selection  of  a  probability  will  be  made. 

This  procedure  continues  from  cell  to  cell  until  all  the 
probabilities  from  some  distribution  are  exhausted  and  then 
the  remaining  closest  distribution  is  used  to  draw  the 
remaining  probabilities.  In  this  way,  all  the  probabilities 
are  assigned  to  cells,  a  probability  for  each  person  in  the 
cell,  although  these  probabilities  are  not  yet  connected  to 
individual  people.  In  Chapter  VIII  we  show  that  this  semi¬ 
random  method  for  selecting  probabilities  does,  in  f'ict, 
fairly  closely  reproduce  the  required  cell  means  as  ierived 
from  the  first  part  of  the  simulation.  Of  course,  the 
approximation  is  better  for  larger  cells,  but  is  also  true 
that  larger  cells  are  more  important  in  the  analysis  of  the 
data.  Also,  since  the  data  will  rarely  be  analyzed  for 
each  of  100,  200,  or  300  cells,  but  generally  for  groups  of 
cells,  the  resulting  net  assignment  is  even  better. 

Taking  account  of  duplication  statistics. — Finally 
we  must  manipulate  these  resulting  probabilities  in  such  a 
way  so  as  to  reproduce  the  empirical  duplication  of  audi¬ 
ences  between  pairs  of  vehicles.  We  do  this  in  the  follow¬ 
ing  manner;  once  the  probabilities  have  been  assigned  to 


cells  of  the  population  we  can  calculate  the  expected  dup¬ 
lication  if  these  probabilities  for  each  vehicle  were 
randomly  assignee  to  individuals  within  the  cell.  Summing 
this  duplicated  audience  over  all  the  population  types 
gives  the  duplication  which  would  be  produced  by  the  sim¬ 
ulation  if  the  probabilities  were  randomly  assigned  within 
cells.  If  this  duplication  is  significantly  different  from 
the  empirical  duplication  then  we  must  consider  assigning 
the  probabilities  within  the  cells  on  a  non- random  basis. 

For  example,  if  the  empirical  probability  is  higher  than 
that  which  would  be  produced  by  the  simulation  using  random 
within-cell  assignment  then  we  may  want  to  increase  the 
duplication  within  each  cell  by  making  people  of  high  prob¬ 
ability  of  exposure  to  vehicle  A  also  have  high  probability 
of  exposure  to  vehicle  B.  Recalling  for  each  probability 
whether  it  comes  from  the  high,  middle,  or  low  exposure 
group  of  its  vehicle  and  also  the  average  probability  of 
each  of  these  groups,  we  calculate  the  number  of  each  pos¬ 
sible  pair  of  probability  assignments  (high-high,  high- 
medium,  high- low,  etc.)  for  any  two  vehicles  in  order  to 
most  closely  reproduce  our  empirically  required  duplication. 
These  non-random  assignments  are  made  for  pairs  of  linked 
vehicles  (vehicles  having  a  large  non-random  duplication) 
within  groups  of  up  to  five  vehicles  at  a  time  (since  dup¬ 
lication  may  be  related  across  several  vehicles).  A  more 
detailed  discussion  of  this  algorithm  follows  in  Chapter  VI 
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These  operations  take  all  the  known  data,  making 
estimates  of  unknown  data  from  the  known  data  where  re¬ 
quired,  and  integrate  the  data  via  the  probabilities  of 
exposure  for  each  individual  for  every  vehicle,  to  produce 
finally  a  set  of  simulated  individuals,  each  of  whom  belongs 
to  a  certain  population  type  as  defined  by  the  population 
dimensions  and  each  of  whom  has  a  probability  of  exposure 
to  each  vehicle  in  the  mass  media.  At  this  point,  if  we 
were  to  run  a  scenario  of  simulated  messages,  we  could 
calculate  expected  exposures  to  each  of  the  vehicles,  the 
variance  of  this  number,  the  duplication,  etc.  The  next 
step  is  to  generate  probabilities  of  exposure  to  the  mes¬ 
sages  given  exposure  to  the  vehicle.  This  process  takes 
place  in  the  second  stage  of  the  simulation. 

The  Second  Stage  of  the  Simulation: 

The  Processing  of  Scenario  Messages  and 
Reporting  of  Exposure  Statistics 

Representing  the  message  exposure  probabilities. — 

In  the  second  stage  of  the  simulation,  we  calculate  condi¬ 
tional  probabilities  of  exposure  to  the  message  given 
exposure  to  the  vehicle  in  order  that  our  exposure  statis¬ 
tics  represent  exposure  to  messages  and  themes  of  messages 
rather  than  exposures  to  vehicles  carrying  the  messages. 
Conceivably,  these  conditional  probabilities  of  exposure 
to  a  message  can  depend  upon  all  of  the  various  factors 
having  to  do  with  exposure  in  the  simulation.  They  may  be 
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a  function  of  the  population  type  of  the  individual,  of  the 
content  or  theme  of  the  message,  of  the  communication 
vehicle  in  which  the  message  is  carried,  of  the  time  period, 
of  the  history  of  previous  exposure  of  the  individual,  etc. 
This  could  mean  then,  that  for  every  message  in  the  scenario, 
we  have  as  many  different  message  exposure  probabilities  as 
there  are  population  types,  which  in  the  present  simulation 
would  be  144  message  exposure  probabilities  for  each  message. 
This  number  of  message  exposure  probabilities  for  each 
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message  is  impossible  to  handle  on  the  present  computer. 
However,  there  is  a  more  important  reason  why  we  do  not 
admit  all  these  possible  message  exposure  probabilities: 
we  simply  do  not  have  the  data  available  to  specify  for 
every  population  type  a  different  message  exposure  probabil¬ 
ity.  The  knowledge  we  have  of  conditional  exposure  prob¬ 
abilities  is  very  meager  indeed,  e.g.,  we  can  differentiate 
average  exposure  levels  for  broad  categories  of  themes  such 
as  international  news  in  general  but  not  between  different 
kinds  of  international  news.  For  the  broad  category  of 


■  Note  also  that  we  are  talking  only  of  the  average 
probability  of  exposure  in  each  population  type;  we  are  not 
taking  into  account  cumulative  conditional  message  exposure 
probabilities  which  would  give  a  distribution  of  probabil¬ 
ities  of  exposure  within  each  type.  If  we  actually  devel¬ 
oped  these  distributions  for  each  message,  we  would  generate 
a  number  of  probabilities  equal  to  the  size  of  the  popula¬ 
tion,  as  we  did  in  the  first  part  of  the  simulation.  In 
this  case,  however,  instead  of  a  number  of  such  distribu¬ 
tions  equal  to  the  number  of  vehicles,  we  could  have  hun¬ 
dreds  of  such  distributions,  one  for  each  message. 
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international  news  we  find  some  indication  of  exposure 
differences  between  people  arranged  on  a  single  dimension, 
i.e. ,  sex,  education,  age,  etc.,  but  not  all  the  possible 
combinations.  Also,  the  available  data  are  all  derived 
from  newspaper  readership,  but  do  not  distinguish  between 
newspapers,  nor  do  tney  distinguish  any  other  media 
vehicles . 

Given  the  kind  of  data  available  and  the  limited 
capacity  for  processing  and  storage  in  the  present  model, 
we  have  compromised  in  the  amount  of  detail  of  the  message 
exposure  probabilities  and  used  the  following  model;  first, 
we  assume  that  there  is  some  intrinsic  level  of  attention 
for  each  of  the  themes  in  the  scenario.  This  attention 
level  is  an  average  message  exposure  probability  ever  all 
possible  formats,  media  vehicles,  time  periods,  population 
types,  etc.,  and  the  researcher  is  required  to  specify  this 
average  level  of  exposure  for  a  theme.  We  know  from  re¬ 
search  that  we  can  make  such  statements  as,  "Twenty  percent 
of  the  people  who  read  newspapers  regularly,  read  interna¬ 
tional  news  in  the  newspapers,"  and  this  kind  of  figure  is 
what  we  mean  by  the  average  exposure  to  a  theme,  given 
exposure  to  the  vehicle. 

Now  consider  any  dimension  which  we  have  reason  to 
believe  influences  exposure  to  the  message  given  exposure 
to  the  vehicle,  e.g, ,  education.  Each  such  dimension  has 
a  number  of  classes  or  levels.  The  second  assumption  is 


that  for  a  given  vehicle,  the  ratio  for  any  two  levels  of 
the  average  probabilities  of  exposure  to  the  message  given 
exposure  to  the  vehicle  is  c  jnstant  for  all  messages  of  the 
theme.  For  example,  we  might  want  to  assume  for  any  message 
relating  to  a  theme  concerning  the  United  Nations,  that 
highly  educated  people  are  twice  as  likely  to  be  exposed 
to  the  message,  given  that  they  are  exposed  to  the  vehicle, 
as  are  poorly  educated  people  who  are  exposed  to  the  vehicle 
This  assumption  says  nothing  about  the  absolute  level  of 
exposure  to  the  message;  it  is  assumed  that  the  absolute 
level  of  exposure  to  the  message  is  governed  by  the  general 
interest  inherent  in  the  theme  and  by  the  format  of  the 
message  in  the  vehicle.  Therefore,  for  each  theme  we  must 
estimate  a  general  level  of  attention  which  is  particular 
to  the  theme  (called  PORTN  in  the  simulation)  and  ratios  of 
exposure  probabilities  for  people  in  adjacent  levels  of  the 
relevant  dimensions. 

We  use  these  ratios  as  input  because  the  researcher 
can  probably  more  accurately  estimate  a  ratio  of  two  ex¬ 
posure  probabilities  than  each  value  separately,  and  also 
because  it  is  not  possible  for  the  researcher  freely  to 
specify  both  the  average  conditional  audience  for  a  theme 
and  also  a  set  of  probabilities  of  exposure  over  a  popula¬ 
tion  dimension;  these  figures  are  related  and  must  be 
compatible.  For  example,  if  for  men  the  conditional  prob¬ 
ability  of  exposure  to  the  message,  given  exposure  to  the 


vehicle  is  0.30/  and  the  corresponding  probability  for 
women  is  0.20,  then  the  average  probability  of  exposure  or 
the  average  proportion  exposed  to  the  message,  given  ex¬ 
posure  to  the  vehicle,  lies  somewhere  between  0.20  and  0.30 
depending  upon  the  number  of  men  and  women  in  the  vehicle 
audience.  If  the  researcher  has  specified  that  for  this 
particular  theme  the  average  probability  of  exposure  for 
the  vehicle  audience  is  0.27,  then  he  is  not  simultaneously 
free  to  choose  both  the  conditional  probability  of  exposure 
for  women  and  that  for  men,  since  the  weighted  average  of 
these  probabilities  must  equal  0.27.  The  researcher  usually 
does  not  know  these  conditional  exposure  probabilities  to 
the  degree  of  accuracy  with  which  he  knows  the  overall 
level  of  exposure  for  the  theme,  and  he  doesn't  need  to  be 
bothered  with  knowing  the  breakdown  of  the  vehicle  audiences 
for  every  vehicle. 

Knowing  the  average  proportion  of  the  vehicle  au¬ 
dience  exposed  to  a  given  theme,  the  ratios  of  exposure 
probabilities  for  those  population  dimensions  chosen  by  the 
researcher,  and  the  distribution  of  the  vehicle  audience 
along  those  dimensions,  we  can  calculate  for  each  vehicle 
the  theme's  message  audience  distributed  across  each  of  the 
dimensions.  The  distribution  across  the  population  sub¬ 
groups  defined  by  the  set  of  dimensions  can  be  calculated 
either  by  cross-products  or  using  the  Mosteller  iteration: 
if  we  use  the  iteration  we  can  distribute  the  theme's 


57 


message  audience  biased  somewhat  according  to  either  the 

vehicle  audience  or  the  population  distribution  in  the 
9 

subgroups.  Finally,  the  ratio  of  the  message  audience  to 
the  vehicle  audience  in  a  subgroup  is,  for  that  subgroup, 
the  average  conditional  message  exposure  probability,  for 
that  theme  and  vehicle. 

How  then  do  we  account  for  differences  in  exposure 
from  message  message  within  a  theme?  Each  message  in 
the  theme  is  described  by  a  format  factor,  a  positive  num¬ 
ber  by  which  we  multiply  the  value  of  the  average  audience 
of  the  theme  in  order  to  get  the  audience  for  the  particular 
message.  Thus,  the  average  proportion  of  the  ahicle  au¬ 
dience  in  the  message  audience  for  the  theme  (PORTN)  is  the 
proportion  for  a  message  with  a  format  factor  of  1.0.  Mes¬ 
sages  which  are  quite  prominantly  displayed  with  pictures, 
headlines,  etc.,  have  format  factors  greater  than  1.0  and 
messages  which  are  buried  inconspicuously  somewhere  in  the 
newspaper  have  format  factors  less  than  1.0.  If  we  were  to 
do  a  study  to  assign  format  factors  to  messages,  we  would 
try  to  develop  from  the  formating  for  each  message  a  multi¬ 
plicative  factor  by  which  we  could  characterize  the  message 
and  predict  the  exposure  to  it. 

Of  course,  this  model  is  a  simplification.  For 
instance,  we  have  reason  to  believe  that  the  exposure  to 

9 

This  technique  is  discussed  in  Chapter  II. 


general  international  affairs  news  is  about  1.8  times  as 
high  among  men  given  exposure  to  the  vehicle  as  it  is  among 
women;  therefore,  for  international  affairs  themes  we  use 
this  probability  ratio  for  every  message.  However,  it 
might  very  well  be  that  the  ratio  changes,  depending  upon 
the  format  of  the  message,  for  example  whether  the  message 
is  on  the  first  page  of  the  newspaper  or  in  the  women's 
pages.  We  do  not  take  account  of  this  possible  interaction 
between  format  and  probability  ratios  except  insofar  as  the 
averages  resulting  from  the  numbers  of  messages  on  the 
front  page  and  the  numbers  of  messages  in  the  women's  pages 
give  an  average  level  of  exposure  to  the  theme.  Neverthe¬ 
less  we  will  consider  this  model  sufficient  since  we  actually 
have  only  poor  data  with  which  to  estimate  the  exposure 
probabilities  for  the  messages.  Since,  in  the  present 
simulation,  the  probabilities  are  themselves  derived  from 
a  non-random  sample  by  a  regression  procedure  which  explains 
only  40  to  50  percent  of  the  variance,  we  feel  that  the 
model  is  sufficiently  complex  for  the  level  of  validity  of 
the  data  and  that  the  errors  introduced  will  generally  be 
small  since  the  mean  exposure  will  be  fairly  accurately 
represented . 

Thus,  the  data  required  for  the  last  stage  of  the 
simulation  consists  of  a  scenario  of  messages  organized  by 
theme  and  into  time  periods,  each  message  of  which  carries 
the  identification  number  of  a  communication  vehicle  in 
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effect ,  the  simulation  has  a  dev ion  which  allows  the  re¬ 
searcher  to  change  (increase  or  decrease)  either  the  vehicle 
exposure  or  the  message  exposure  probability  or  both,  based 
either  upon  previous  exposure  to  some  theme  or  upon  the 
time  period.  We  allow  the  researcher  to  specify  as  many  as 
three  themes,  called  "trigger"  themes,  exposure  to  which 
(a  given  minimum  number  of  times)  changes  the  probability 
of  exposure  to  other  themes  in  subsequent  time  periods. 

In  order  to  use  this  facility  the  researcher  must 
specify  which  of  the  first  three  themes  are  trigger  themes 
and  a  number  of  exposures  to  a  trigger  theme  which  will 
trigger  increased  exposure  to  other  themes.  This  triggering 
may  be  based  on  past  exposure  to  any  of  the  trigger  themes 
and/or  the  current  theme  and/or  the  time  period.  The 
amount  of  the  change  in  each  of  the  probabilities  may  be  a 
function  of  the  media  type  of  the  message,  of  the  time 
period,  or  of  the  population  type  of  the  person  being  ex¬ 
posed.  The  direction  and  the  magnitude  of  .ne  change  of 
the  probability  is  specified  by  the  researcher  and  includes 
a  parabolic  ceiling  and/or  floor  effect  which  prevents 
probabilities  from  becoming  greater  than  1.0  or  less  than 
0.0. 

The  modification  of  probabilities  based  on  time 


periods  is  intended  to  allow  the  researcher  to  account  for 
exogenous  variables  such  as  an  initial  exposure  to  a  message 
via  word-of-mouth  communication.  The  change  in 
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probabilities  (the  "triggering"  effect)  can  be  reversed 
slowly  or  abruptly  as  desired  by  the  researcher.  In  fact 
it  is  possible  to  impose  up  to  three  cycles  of  triggering 
in  the  simulation  run.  Si ice  the  themes  of  the  present 
simulation  concern  common  day-to-day  news  events  we  have 
not  used  the  triggering  effects  in  our  simulation  runs. 


Processing  the  messages  and  outputting  audience 
exposure. — At  the  output  stage  of  the  simulation,  each 
individual's  probability  of  exposure  to  each  message  is 
calculated.  As  originally  conceived,  this  probability 
would  be  used  in  a  Monte  Carlo  routine  to  determine  whether 
an  exposure  had  taken  place  for  the  given  person  and  mes¬ 
sage,  resulting  in  survey-like  records  of  exposure.  As 


i 

I  with  survey  data,  these  records  would  then  be  aggregated  in 

many  different  kinds  of  tables.  However,  with  this  method 
of  producing  exposure  tables,  it  is  necessary  to  perform 
(at  a  minimum)  several  runs  of  the  simulation  and  average 
the  results  to  get  some  indication  of  the  expected  outcomes 
and  the  range  of  possible  outcomes.  Since  a  typical  sim¬ 
ulation  (involving  up  to  3,000  persons  and  1,000  or  so 
messages  over  10  time  periods)  requires  several  hours  of 
computer  time,  it  is  quite  costly  to  use  the  Monte  Carlo 
solution  to  the  simulation. 

Fortunately,  there  are  at  least  two  other  techniques 


for  generating  or  (more  accurately)  describing  simulation 
outcomes.  These  techniques  are  doubly  advantageous  in  that 
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not  only  do  they  require  only  one  run  of  the  simulation, 
but  also  they  produce  more  information  about  the  likely 
outcomes  of  the  simulation.  The  first  of  these  techniques 
calculates,  for  each  of  the  possible  outcomes  of  the  sim¬ 
ulation,  the  probability  of  that  particular  outcome.  Thus, 
we  might  have  for  each  individual  the  probability  of  his 
being  exposed  exactly  once,  twice,  three  times,  four  times, 
etc.  to  the  third  theme  of  the  simulation  by  the  fourth 
time  period.  This  kind  of  information  is  especially  useful 
in  testing  the  event  validity  of  the  simulation,  i.e.,  in 
testing  the  simulation  against  real  data.  For  example,  if 
we  had  data  demonstrating  that  a  particular  member  of  the 
NORC  survey  panel  was  exposed  three  times  to  the  second 
theme  by  the  fourth  time  period  and  the  simulation  predicted 
a  relatively  high  probability  of  exactly  three  exposures 
for  the  person,  then  we  would  increase  our  confidence  in 
the  validity  of  the  simulation. 

Most  simulations  cure  programmed  to  report  only  a 
single  outcome  even  though  probability  estimations  are 
explicit  or  implicit  in  the  model,  e.g.  an  event  is  re¬ 
ported  as  surely  happening  when  a  threshold  of  some  index 
or  scale  is  passed.  This  kind  of  reporting  makes  validation 
somewhat  difficult  since  it  does  not  include  reports  on  the 
likelihood  of  other  outcome  states  of  the  simulation.  In 
terms  of  the  example  above,  if  of  a  large  number  of  pos¬ 
sible  exposures,  the  simulation  gave  our  person  a  highest 
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probability  of  two  exposures  and  a  second  highest  probabil¬ 
ity  of  three  exposures,  we  would  lose  much  information  if  it 
only  reported  two  exposures:  we  might,  if  given  full  in¬ 
formation,  consider  the  simulation  to  have  significant 
validity.  Thus,  knowing  the  probabilities  associated  with 
each  of  the  outcome  states  is  essential  for  assessing  the 
degree  of  validity  of  the  simulation.1^ 

Describing  the  outcome  states  of  the  simulation  and 
their  associated  probabilities  is  the  most  complete  descrip¬ 
tion  possible  of  the  simulation  outcomes.  However  ofcen  it 
requires  much  computation  to  arrive  at  the  probabilities 
for  each  of  the  states.  In  the  present  simulation,  the 
computation  becomes  unmanageable  because  of  the  essentially 
aggregate  nature  of  the  outcomes.  It  makes  uo  sense  to 
identify  particular  individuals  in  the  simulation  population 
with  particular  individuals  in  the  NORC  sample  panel  with 
which  the  simulation  is  to  be  validated,  because  the  whole 
modeling  of  the  simulation  has  taken  place  in  terms  of 
subgroups  or  cells  of  people,  all  of  whom  are  taken  as 


°This  approach  to  validation  of  complex  models  is 
described  in  some  detail-including  such  notions  as  the  de¬ 
scriptive  power  as  contrasted  with  the  predictive  power  of 
a  model-in  three  papers  by  Joseph  F.  Hanna.  The  first  of 
these  papers  is  entitled  "A  New  Approach  to  the  Formulation 
and  Testing  of  Learning  Models"  published  in  Synthese ,  Vol¬ 
ume  16,  1966,  pages  344-380.  The  second  is  Some  Information 
Measures  for  Testing  Stochastic  Models  (Michigan  State 
University:  mimeo,  1967) ,  and  the  third  is  Information- 
Theoretic  Techniques  for  Evaluating  Simulation  Models 
(Michigan  State  University:  mimeo,  1^6$). 
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equivalent  within  each  cell.  Thus,  the  simulation  data 
which  can  be  meaningfully  compared  with  the  real  world 
panel  are  aggregate  statistics  for  the  cells  of  the  sim¬ 
ulation.  An  example  of  such  a  statistic  might  be  the  prob¬ 
ability  that  between  25  to  35  percent  of  the  fifth  cell  of 
the  simulation  population  is  exposed  to  exactly  0,  1,  2,  3, 
4,  or  5,  etc.  messages  for  a  particular  theme  and  time 
period.  This  kind  of  statistic  is  very  hard  to  compute 
exactly  because  it  involves  the  probability  distribution 
for  the  aggregate  of  people  in  the  cell  over  all  possible 
outcomes  for  a  set  of  messages. 

In  order  to  avoid  this  difficulty,  instead  of  cal¬ 
culating  the  values  of  the  probability  distribution  for  a 
cell,  we  calculate  an  expected  outcome  for  the  cell,  i.e. 
we  take  the  expectation  of  the  probability  distribution 
which  is  so  difficult  to  calculate.1^  In  this  framework 
the  previous  statistic  would  become  the  expected  number  of 
persons  exposed  exactly  once  in  the  fifth  cell  of  the  pop¬ 
ulation.  However,  at  this  point,  we  must  acknowledge  a 
I  difficulty  in  the  use  of  this  kind  of  a  statistic  in  the 


It  often  happens  that  the  calculation  of  a  prob¬ 
ability  distribution  is  very  difficult  and  time  consuming, 
ox  requires  an  inordinate  amount  of  space  in  the  computer, 
but  that  the  calculation  of  certain  parameters  such  as  the 
expected  value  is  quite  feasible  and  straightforward.  For 
a  simple  illustration  of  this  situation,  see  the  short 
article  by  Ronald  A.  Howard  entitled  "Stochastic  Process 
Models  of  Consumer  Behavior"  in  the  Journal  of  Advertising 
Research,  Volume  4,  1966,  pages  35-42. 
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present  simulation:  our  expected  values  are  not  quite 
•  actually  expected  values.  In  the  first  stage  of  the  sim¬ 

ulation  a  random  process  was  used  to  distribute  probabil¬ 
ities  within  and  across  ce.)is.  Thus,  the  expected  values 
which  are  generated  in  the  r-econd  stage  of  the  simulation 
as  output  data  are  conditioned  upon  the  assignment  of  prob¬ 
abilities  within  and  across  cells  which  occurred  in  the 
first  stage  of  the  simulation.  There  are  constraints  upon 
the  assignment  of  probabilities  within  cells,  namely  that 
the  probabilities  must  come  clc  ;e  to  reproducing  the  cell 
means  and  that  the  probabilities  are  chosen  from  a  pre¬ 
scribed  distribution  generated  by  the  cumulation  data. 
Because  cf  these  constraints  and  because  the  expected 
values  are  almost  always  reported  for  larger  population 
subgroups  *i.e.  combinations  of  the  144  basic  cells),  we 
find  (in  Chapter  VIII)  that  the  conditional  expected  values 
are  rather  close  to  the  true  expected  values  given  all 
possible  assignments  within  the  assignment  process. 

Ideally,  with  each  expected  value  we  would  report 
another  statistic,  the  variance,  which  would  help  to  define 
the  probabilities  of  the  outcome  states  of  the  simulation. 
If  we  know  the  expected  number  of  exposures  in  a  subgroup 


12 

Unfortunately  in  programming  the  current  version 
of  the  simulation  we  did  not  realize  the  importance  of  the 
variance  here,  and  so  we  did  not  report  it.  Also  however, 
it  would  have  been  quite  difficult  to  find  storage  space 
for  twice  the  present  number  of  statistics. 
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and  the  variance  in  the  number  of  exposures  f  we  have  a 
better  idea  about  the  range  of  likely  outcomes  about  the 
expected  outcome.  It  may  also  be  true  that  most  of  the 
distributions  for  which  we  report  expected  values  approx¬ 
imate  normal  distributions,  in  which  case  the  knowledge  of 
the  expected  value  and  the  variance  would  be  sufficient  to 
describe  completely  the  probability  distribution  over  all 

of  the  possible  outcomes  of  the  simulation  for  these 
13 

statistics. 


The  Exposure  Statistics 

What  kinds  of  statistics  should  be  output  from  the 
simulation?  Because  the  processes  of  the  simulation  operate 
at  the  level  of  the  population  subgroup,  we  have  chosen  to 
report  summary  statistics  for  the  subgroup,  i.e.  the  ex¬ 
pected  number  exposed  and  the  average  expected  number  of 
exposures.  These  statistics  are  calculated  for  each  time 
period  and  also  cumulatively.  We  do  not  report  the  distri¬ 
bution  of  exposure  within  a  subgroup,  although  we  now  feel 


Since  the  messages  have  message  exposure  probabil¬ 
ities  which  differ  from  message  to  message,  the  probabilities 
are  not  constant  for  each  trial  and  the  probability  distri¬ 
bution  for  the  number  of  exposures  is  very  complex.  None¬ 
theless,  since  there  may  be  several  hundred  or  more  messages 
in  a  single  theme,  it  seems  plausible  that  the  probability 
distributions  might  very  well  approximate  normal  distribu¬ 
tions.  (The  Central  Limit  Theorem  may  be  relevant  here.) 

The  matter  of  the  approximation  to  the  normal  might  be  an 
area  of  fruitful  research  in  the  future  because  it  would 
mean  that  by  including  the  variances  as  output  in  the  sta¬ 
tistics,  we  could  quite  significantly  improve  the  descrip¬ 
tion  of  the  output  of  the  simulation. 


that  this  is  an  important  statistic.  Each  of  these  statis¬ 
tics,  however,  could  possibly  be  reported  by  a  bewildering 
array  of  other  attributes,  e.g.  by  message,  theme,  time 
period,  vehicle,  audience  type,  cumulation  distribution 
type,  or  by  combinations  of  themes  (to  get  duplicated  au¬ 
diences)  .  In  addition,  the  variance  of  each  statistic 
could  be  reported. 

A  rough  calculation,  then,  gives  a  possibility  of 
hund.  ds  of  millions  of  different  data  items  which  might  be 
of  it  est  to  the  researcher.  Of  course,  this  is  an  im¬ 
possible  number  of  data  items  to  deal  with  and,  in  fact, 
we  must  make  some  simplifying  assumptions  in  order  to  be 
able  to  offer  some  reasonable  possibilities  to  the  research¬ 
er  and  still  keep  the  program  manageable.  One  of  the  sim¬ 
plifying  assumptions  is  that  messages,  although  input  as 
individual  messages,  are  organized  by  themes,  and  in  the 
output  phase  an  individual  message  is  not  distinguished 
unless  a  particular  theme  has  only  one  message  in  it. 
Therefore,  instead  of  300  or  400  different  messages  to  be 
considered  as  output,  we  actually  consider  some  much  smaller 
number,  say  15  or  20  themes.  We  have  noted  above  that  we 
do  not  report  the  exposure  frequency  distributions  or  any 
variances.  Also,  most  statistics  are  not  output  by  64 
vehicles,  although  basic  exposure  by  vehicle  is  given.  We 
have  defined  media  types,  of  which  there  may  be  at  most 
six,  which  group  the  vehicles  together:  therefore,  the 


64  possible  vehicles  may  be  grouped  into  any  one  of  six 
media  types  and  most  statistics  are  output  by  the  six  media 
types . 

This  is  the  extent  of  the  simplifying  of  the  output 
data.  It  nevertheless  leaves  a  huge  amount  of  possibilities 
for  output,  and  the  computer  could  obviously  generate  such 
a  large  number  of  statistics  that  it  would  inundate  tne 
researcher.  Therefore,  the  researcher,  even  at  this  stage, 
is  asked  by  the  simulation  to  specify  the  tables  of  interest 
to  him.  The  most  important  choice  is  -he  population  sub¬ 
groups  for  tables.  It  seems  quite  unlikely  that  with  144 
population  types,  as  in  the  present  simulation,  a  table 
printed  out  with  144  values  could  be  comprehended  by  the 
researcher.  Therefore,  these  tables  can  be  collapsed,  and 
one  may  ask  for  any  combination  of  tables  with  any  order  of 
dimensions-  e.g.  tables  of  two  dimensions,  three  dimensions, 
four  dimensions,  and  so  on,  as  desired,  rather  than  looking 
at  the  table  of  highest  possible  dimensionality.  Below  we 
list  the  resulting  output  statistics  for  the  simulation. 

A  Summary  of  the  Simulation  Output 
Statistics 

1.  The  number  of  exposure  events  by  media  vehicle. 

2.  The  cumulative  number  of  exposure  events  by  media 
vehicle. 

3.  The  number  of  exposure  events  via  the  media  type. 

4.  The  cumulative  number  of  exposure  events  via  the 
media  type. 


5.  The  number  and  percentage  of  each  population  type 
exposed. 

6.  The  cumulative  number  and  percentage  of  each  pop¬ 
ulation  type  exposed. 

7.  The  number  and  percentage  of  each  population  type 
exposed  via  media  type. 

8.  The  cumulative  number  and  percentage  of  each  pop¬ 
ulation  type  exposed  via  media  type. 

9.  The  total  and  average  cumulative  number  of  exposure 
events  for  each  population  type. 

10.  The  total  and  average  cumulative  number  of  exposure 
events  for  each  population  type  via  media  type. 

11.  The  cumulative  number  and  percentage  of  each  pop¬ 
ulation  type  in  the  audience  of  both  the  current 
theme  and  a  trigger”  theme. 


A  Summary  Outline  of  the  Mass  Media 
Simulation 

One  common  way  of  approaching  a  piece  of  research 
in  social  science  is  to  ask  what  are  the  independent  and 
dependent  variables  treated  in  the  study.  In  addition  one 
should  ask  what  kind  of  assumptions  or  models  allow  us  to 
relate  the  values  of  the  independent  and  dependent  variables. 
The  table  of  output  statistics  above  may  be  considered  a 
description  of  the  dependent  variables  in  the  simulation; 
we  present  here  a  brief  summary  of  the  independent  variables 
and  the  assumptions  which  relate  them  to  the  dependent 
variables. 


A  Summary  of  the  Independent  Variables 

1.  The  (sometimes  partially)  known  distribution  of  the 
population  across  the  dimensions  chosen  by  the 
re-,  marcher. 
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2.  The  (sometimes  partially)  known  distributions  of 
the  media  vehicle  audiences  across  these  dimensions. 

3.  For  each  vehicle,  the  breakdown  of  the  population 
into  high,  medium,  and  low  vehicle  exposure  groups 
and  the  average  and  two-period  cumulative  audience 
for  each  group. 

4.  The  known  duplicated  audience  for  any  pair  of 
vehicles. 

5.  A  set  of  messages,  organized  by  theme  and  time 
period,  with  a  media  vehicle  identification  and  a 
format  factor  for  each  message. 

6.  For  each  theme,  the  average  proportion  of  the 
vehicle  audience  exposed  to  the  theme. 

7.  For  each  group  of  vehicles,  for  each  theme,  for 
some  subset  of  the  population  dimensions,  the 
ratios  of  conditional  message  exposure  probabilities 
of  adjacent  classes  along  each  dimension. 


Assumptions  in  the  Simulation  Model 

We  attempt  to  enumerate  here  the  most  important 

assumptions  relating  either  to  the  simulation  model,  or 

relating  to  the  data  of  this  particular  simulation  or  model 

instance,  or  combinations  of  the  two  of  these. 

1.  We  assume  that  the  combination  of  our  incomplete 
data  and  the  Mosteller  parameter  estimation  itera¬ 
tion  which  produces  maximum  likihood  estimates  of 
population  and  audience  values  in  the  cells  of  the 
simulation  is  sufficiently  accurate  for  estimating 
unknown  parameters.  This  is  equivalent  to  assuming 
that  unknown  interactions  in  the  data  can  be  neg¬ 
lected,  either  because  they  are  very  small,  or 
because  the  communication  vehicle  or  population 
subgroup  involved  is  not  of  major  importance  for 
the  particular  scenario  of  messages  whose  flow  we 
are  attempting  to  simulate. 

We  assume  that  we  can  model  the  flow  of  messages  to 
each  individual  as  a  Bernoulli  process  in  which  a) 
the  probability  of  exposure  to  one  message  by  an 
individual  is  independent  of  the  probability  of 


2. 
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exposure  to  another  message  by  the  individual,  b) 
the  probabilities  are  independent  from  individual 
to  individual,  and  c)  these  probabilities  do  not 
change  over  time  or  at  least  during  the  duration 
of  the  time  simulated  by  the  scenario  of  messages. 

3.  We  assume  that  the  three  beta  function  model  of  the 
distribution  of  these  probabilities  for  each  commu¬ 
nications  vehicle  adequately  describes  the  individ¬ 
ual  vehicle  exposure  probabilities  for  the  popula¬ 
tion. 

4.  We  assume  that  the  quasi-random  process  of  assigning 
probabilities  to  cells  from  the  cumulation  distri¬ 
bution  makes  a  realistic  integration  of  the  popula¬ 
tion  cumulation  data  and  the  cell  audience  data, 
i.e.  that  the  process  sufficiently  well  reproduces 
the  subgroup  average  and  cumulative  audiences. 

5.  We  assume  that  the  vehicle  audience  duplications 
not  accounted  for  by  the  cell  mean  exposure  prob¬ 
abilities  can  be  reproduced  by  the  non-random 

wi thin-cell  assignment. 

6.  We  assume  that  the  model  of  theme-based  conditional 
message  audiences  with  individual  message  audiences 
produced  (multiplicatively)  by  format  factors  and  a 
single  average  conditional  message  exposure  prob¬ 
ability  for  a  population  subgroup  adequately  fits  a 
wide  variety  of  empirical  exposure  data. 

Finally,  we  present  flow  diagrams  of  the  two  stages 

of  the  simulation.  In  the  first  stage,  the  simulation  may 

be  thought  of  as  a  very  efficient  data  disaggregator- 

integrator,  or  a  consistency  machine.  Such  diverse  data  as 

population  tables,  audience  and  cumulation  subtables  and 

marginals  for  each  of  several  media  vehicles,  plus  audience 

duplication  figures  between  pairs  of  media  vehicles  are 

transformed  into  probabilities  of  exposure  events  assigned 

to  members  of  a  model  population  on  the  basis  of  the  several 

assumptions  described  above.  The  flow  diagram  of  Figure  1-1 


shows  the  transformation  of  the  data.  The  most  important 
theoretical  assumptions  are  represented  by  ovals  in  the 
flow  diagram. 

The  second  stage  of  the  simulation  is  the  message 
exposure  and  reporting  stage.  Messages  are  processed  time 
period  by  time  period  and  theme  by  theme  and  the  growth  of 
exposure  is  reported. 


Population  Inputs : 

Census-like  data  including 
all  relevant  attributes, 
e.g.,  age,  sex,  literacy, 
region,  etc. 


Possible  parameter  estimation  (with  zero 
unknown  cell  interaction  assumed)  to  create 
all  crlls  of  the  population  from  incomplete 
(marg.x-tl)  data. 


The  resulting  popu¬ 
lation  "P,"  with 
attributes ,  i .  e . , 
scattered  into  cells. 


Audience  Inputs  for  Each  Vehicle 


Breakdown  or  tne  average 
audience  of  the  vehicle  by 
any  or  all  attributes  of  the 
population. 

2.  For  each  distribution  in  each 
vehicle,  the  portion  of  the 
population  in  that  distribution, 
the  average  audience,  and  the 
average  two-period  cumulation. 


Parameter  estimation  (possibly  with 
interactions  as  found  in  the  population 
"P")  of  the  audience  data  to  produce  a 
mean  probability  of  exposure  in  each 
cell  for  each  vehicle. 


The  population  "P"  with  attributes,  i.e., 
scattered  into  cells,  plus  cell  means 
(ratings)  for  each  vehicle. 


'General  assumption  of  a  Bernoulli  process 
to  model  exposure,  i.e.,  each  member  of  the 
population  has  a  fixed  probability  of  ex¬ 
posure  to  each  vehicle,  for  each  character¬ 
istic  time  period  and  independent  trials. 

Assumption  of  a  (two-parameter)  beta  function 
probability  distribution  for  each  exposure 
(distribution,  3  distributions  to  a  vehicle. 


Audience  Inputs  for  Pairs  of 


Vehicles  Where  Aval 


1.  Average  duplicated 
audience 


For  each  vehicle,  probabilities  are  produced  fcr 
each  member  of  the  population  so  as  to  reproduce 
the  average  and  two  period  cumulative  audience 
for  that  vehicle.  Then  the  probabilities  are 
assigned  to  individuals  so  as  to  reproduce  both 
cell  means  and  known  duplication  between 
vehicles. 


At  this  stage  the  population  can  be  laid  out  schematically 
in  the  following  matrix: 


I . D .  Nos . 

Various  Attri- 

Probabilities  of 

of  the 

butes  of  each 

exposure  to  each 

Popula- 

member  of  the 

vehicle.  For  each 

tion. 

| 

population. 

1  i 

f  1 

vehicle  for  exposure 
distribution  in 
which  the  person  is 
found. 

2000 

1  jL  If  If  V 

|l  1  1  i  1 

people 

f  f  1  »  1 

up  to  64  vehicles 

Figure  1-1.  The  First  Stage  of  the  Simulation:  Data  Disaggre- 
tion  and  Integration. 


Message  Audience  Inputs  for  Each  Theme; 

1.  The  proportion  of  the  vehicle  audience 
exposed  to  a  message  carrying  the  theme 

2.  For  some  of  the  population  dimensions, 
ratios  of  average  <nditional  message 
exposure  probabilities. 


Distribution  of  the  message  audience  for  each 
vehicle  over  the  subgroups  defined  by  the  di¬ 
mensions  above,  using  the  parameter  estimation 
routine . 


Average  conditional  message  exposure 
probabilities  for  the  population 
subgroups  for  each  vehicle. 


Message  Inputs  by  Theme  and  Time 
Period: 

1.  The  number  of  messages  in  each 
time  period  for  this  theme. 

2.  The  vehicle  identification  number 
for  each  message. 

3.  The  format  factor  for  each  messaap. 


The  population 
with  its  vehicle 
exposure  probabil¬ 
ities  from  the 
first  stage  of 
the  simulation. 


For  each  person  and  each  message,  calculate  the  \ 
product  of  the  message  exposure  probability,  the  j 
format  factor,  and  the  vehicle  exposure  prob¬ 
ability  to  get  the  net  probability  of  exposure. 
Then  calculate  his  cumulative  number  of  expo¬ 
sures,  cumulative  non-exposure  probability,  etc. ,  i 
record  these,  and  cumulate  output  statistics.  I 


Output  exposure  statistics  for 
this  theme  and  time  period  and  then 
go  on  to  the  next  time  period. 


Figure  1-2.  The  Second  Stage  of  the  Simulation! 

Messages  and  Reporting  Exposures. 


Processing 
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CHAPTER  II 

THE  MOSTELLER  PARAMETER  ESTIMATION  TECHNIQUE 

In  the  first  chapter,  we  discussed  several  situa¬ 
tions  in  which  we  had  to  estimate  cell  entries  in  multi¬ 
dimensional  tables  from  subtables  of  lower  dimensionality, 
e.g. ,  in  forming  the  population  and  in  distributing  the 
vehicle  audiences  and  the  message  audiences.  There  are  a 
variety  of  such  situations  in  general  in  which  the  social 
scientist  might  use  such  a  parameter  estimation  routine. 

Sometimes  the  data  available  describing  the  pop¬ 
ulation  is  from  several  different  sources,  each  one  of 
which  has  some  but  not  all  of  the  dimensions  of  interest 
to  the  researcher.  This  is  a  very  common  occurrence.  For 
instance,  in  the  case  of  the  Soviet  population,  we  have 
census  data  by  sex,  age,  residence,  but  not  by  party  affil¬ 
iation.  However,  there  are  data  available  from  other 
sources  on  the  breakdown  of  party  members  by  sex,  age, 
education,  literacy,  etc.,  and  we  might  wish  to  combine 
these  data  to  construct  a  model  of  the  Soviet  population.1 

^Herbert  J.  Selesnick  is  presently  simulating 
Soviet  exposure  to  news  of  two  international  crises  and 
has  encountered  this  problem  in  the  process  of  building 
a  simulation  population. 
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The  technique  which  we  shall  describe  below  allows  one  to 
join  these  subtables  together  to  produce  a  population  which 
reflects  all  the  interactions  in  the  known  data  and  no 
other  interactions.  Of  course,  this  is  only  possible  pro¬ 
vided  that  the  subtables  are  themselves  consistent.  We 
shall  discuss  at  some  length  this  problem  of  consistency 
or  inconsistency  of  data  from  several  sources. 

Another  kind  of  problem  for  which  one  might  use  the 
parameter  estimation  routine  is  the  problem  of  sparse  data 
in  contingency  tables,  where  the  cell  sizes  are  so  small 
that,  taken  individually,  we  can  have  relc lively  little 
confidence  that  they  are  precise  enough  estimates  of  the 
underlying  population.  One  way  to  handle  this  problem  is 
to  collapse  the  contingency  table,  either  collapsing  out 
entire  dimensions  or  collapsing  categories  within  dimen¬ 
sions,  to  provide  more  stable  estimates  of  the  resulting 
subtable  values.  If,  however,  for  purposes  of  simulation 
or  calculation  of  rates  or  whatever,  we  require  the  table 
in  its  highest  dimensionality,  this  procedure  will  not  do. 

A  solution  to  this  problem  is  to  perform  the  collapsing 
of  certain  dimensions  of  the  contingency  table  in  order  to 
get  the  more  precise  estimates  of  the  cell  values.  Then  we 
assume  no  interaction  at  the  higher  dimensionality  level 
and,  on  the  basis  of  this  assumption,  generate  expected 
values  from  the  lesser  dimensionality  tailes  in  which, 
presumably,  we  can  have  more  confidence.  This  procedure 


produces  smoothed  contingency  tables  based  on  the  assumption 
of  no  interaction  at  a  level  of  dimensionally  higher  than 
that  represented  in  the  tables  which  were  produced  by  col¬ 
lapsing  out  some  subset  of  the  original  dimensions. 

Hosteller  characterizes  this  approach  as 

.  .  .  borrow  ting]  strength  for  the  cell  estimation  from 
the  margins  by  giving  up  the  contributions  of  the 
higher  order  effects  .  .  .  This  is  not  to  deny  the 
possible  existence  of  such  [higher-order]  interactions. 
Rather  the  thought  is  that  it  may  be  profitable  for 
cell  estimation  purposes  to  ignore  them.  A  simple 
analogy  arises  in  ordinary  regression  where  we  often 
fit  a  straight  line  or  a  parabola  when  we  know  very 
well  that  the  true  state  of  affairs  is  more  complicated, 
and  sometimes  even  when  we  have  a  clear  picture  of  how 
the  true  state  of  affairs  should  be  represented.  We  do 
this  because  our  estimation  may  be  better  when  using  a 
simple  model  in  a  limited  range,  even  though  the  model 
is  wrong,  than  when  fitting  the  correct  model.2 

In  the  present  simulation,  we  use  the  parameter 
estimation  technique  in  both  of  the  ways  described  above. 

In  calculating  the  population  and  daily  newspaper  audience 
distributions,  we  use  data  from  the  NORC  sample  which  was 
too  small  (N=745)  to  provide  stable  cell  estimates  for  the 
144  population  cells.  Therefore,  we  collapse  the  five¬ 
dimensional  table  to  the  ten  three-dimensional  subtables 
from  which  we  make  "smoother"  estimates  of  the  cell  entries 
in  the  five-dimensional  table.  For  the  radio  audiences  we 
simply  use  our  estimated  subtable  values  directly  in  the 
iteration. 


2 

Frederick  Mosteller,  "Association  and  Estimation 
in  Contingency  Tables,"  Journal  of  the  American  Statistical 
Association,  63  (1968) ,  p.  19. 
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Let  us  describe  the  theoretical  foundations  for 
this  iterative  technique  for  estimating  cell  values.  First, 
we  introduce  some  notation  to  deal  with  the  problem.  In 


Fig.  II-l,  we  have  drawn  a  2x2x2,  three-dimensional  contin¬ 
gency  table  as  a  cube  with  three  faces.  The  values  in  the 
interior  cells  of  the  cube  are  labeled  with  x.  .  Here,  i 
represents  the  value  of  the  level  of  the  first  dimension, 
j  is  the  level  of  the  second  dimension,  and  k  is  the  level 
of  the  third  dimension.  If  we  sum  the  x. over  all  the 

1]JC 

levels  of  any  one  dimension,  the  resulting  values  will  be 
represented  with  a  plus  sign  {+)  at  the  place  in  the  sub¬ 
scripts  corresponding  to  the  dimension  which  has  been 
summed  out.  Summing  out  a  dimension  leaves  a  new  set  of 
values  and  if  it  is  only  one  dimension  that  is  summed  out, 
the  new  set  of  values  are  just  the  values  which  we  would 
put  on  one  of  the  faces  of  the  cube.  Thus,  the  cube  below 

has  three  faces,  x.  . ,,  x.  ,  ,  and  x,  . 

ij+  i+k'  +jk 


Fig.  II-l.  A  Representation  of  a  2x2x2  Table  as  a  Cube. 


If  we  are  willing  to  assume  that  there  are  no  third 
order  interactions,  i.e.,  that  the  only  interactions  in  the 
table  are  represented  by  the  values  on  the  faces  of  the 
cube,  then  Birch^  has  shown  that  a)  the  marginal  totals 
(the  totals  on  the  faces)  are  maximum  likelihood  estimates 
of  their  expectations,  b)  there  is  a  unique  point  for  which 
the  likelihood  function  is  a  maximum,  and  c)  the  maximum 
likelihood  estimates  for  the  interior  cell  values  are 
determined  uniquely  by  the  appropriate  marginal  totals  (by 
the  values  on  the  faces)  and  these  marginal  totals  are 
preserved.  This  means  that  if  we  know  only  the  faces  of 
the  cube  and  are  willing  to  assume  that  there  is  no  third 
order  interaction,  then  the  values  of  those  faces  under  a 
variety  of  sampling  conditions  are  the  sufficient  statistics 
for  the  likelihood  function  and  from  the  face  values  we  can 
generate  a  unique  most  likely  set  of  interior  cell  values. 
These  interior  cell  values  will  sum  appropriately  to  pre¬ 
serve.  the  known  cell  values. 

When  can  we  make  the  assumption  that  there  are  no 
interactions  other  than  the  known  interactions?  In  general, 
this  depends  upon  the  situation  for  which  we  are  using  the 
estimation  technique;  however,  if  we  have  a  choice  about 
how  many  dimensions  to  include  in  the  faces,  Birch 

3 

M.  M.  Birch,  "Maximum  likelihood  in  three-way 
contingency  tables,"  Journal  of  the  Royal  Statistical 
Society,  Series  B,  25  (1963),  220-33. 
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postulates  a  hierarchy  principle.  The  principle  implies 

that  if  a  lower  order  interaction  is  zero,  then  all  higher 

order  interactions  involving  the  same  combination  of  var- 

4 

lables  are  also  zero.  One  can  give  examples  of  strange 
contingency  tables  where  this  hierarchy  principle  is  not 
true,  but  it  seems  a  reasonable  presumption  in  the  face  of 
no  other  knowledge.  Of  course,  if  we  are  in  the  position 
of  the  researcher  who  has  gathered  together  all  the  sub¬ 
tables  he  can  about  a  population  and  is  not  smoothing  data, 
but  trying  to  generate  absolutely  unknown  data,  we  will 
usually  be  forced  to  assume  that  there  are  no  other  inter¬ 
actions.  We  must  be  satisfied  with  a  population  which 
reflects  all  the  known  data  and  nothing  more. 

The  iteration  which  calculates  interior  cell  values 
from  the  faces  proceeds  in  the  following  manner:  first  we 
initialize  the  interior  values,  usually  by  giving  them  all 
an  initial  value  of  1.0,  i.e. ,  no  interaction.  Then,  be¬ 
ginning  with  one  of  the  cells  of  the  known  subtables,  e.g. , 


4 

Mosteller’s  student  Bishop  has  investigated  the 
choice  of  lower  dimensional  tables  to  estimate  values  in 
higher  dimensional  subtables  for  the  Halothane  study.  See 
Yvonne  M.  M.  Bishop,  Multidimensional  Contingency  Tables: 
Cell  Estimates,  Ph.D."  Thesis,  Department  of  Statistics, 
Harvard  University,  Cambridge,  Massachusetts,  February, 

1967;  and  also  Chapter  4  of  J.  P.  Bunker,  W.  H.  Forrest, 

Jr.,  Frederick  Mosteller,  and  L.  D.  Vandam  (editors),  The 
National  Halothane  Study,  A  study  of  the  possible  associa¬ 
tion  between  halothane  anesthesia  and  postoperative  hepatic 
necrosis^  Report  of  the  Subcommittee  on  the  National  Halo- 
thane  Study  of  the  Committee  on  Anesthesia,  National  Academy 
of  Sciences — National  Research  Council.  In  press,  will  be 
available  from  Division  of  Medical  Sciences,  National  Re¬ 
search  Council,  2101  Constitution  Avenue,  Washington,  D.C. 
20418. 


one  of  the  cells  on  a  face  of  the  cube,  we  sum  the  approp¬ 
riate  interior  values  and  compare  the  sum  with  the  required 
cell  value  in  the  subtable.  If  the  sum  differs  from  the 
required  value,  we  multiply  each  of  the  interior  cell 
values  included  in  the  sum  by  the  ratio  of  the  required 
value  to  the  sum,  thereby  normalizing  each  of  these  in¬ 
cluded  values  so  that  they  do  sum  correctly.  In  terms  of 
our  cube  above,  if  x*.  . ,  is  the  sum  of  the  initial  interior 
values ,  then 


x!  .  ,  =  x.  +  x. 

13+  13I  132 

We  would  then  create  new  interior  values  xl^  by  multiply- 


Xijk  Xijk  *  [xij+  /  xij+  ]* 

We  proceed  in  this  fashion  until  the  sums  fc  all  the  values 
in  the  subtable  (on  the  face)  sum  correctly.  Then  we  go  on 
to  the  next  subtable  or  face.  Of  course,  as  we  process  the 
second  subtable,  we  will  change  the  interior  values  calcu¬ 
lated  from  the  first  subtable,  and  so  the  first  sums  will 
no  longer  agree  with  the  first  subtable  values.  However, 
as  we  repeat  this  process  through  the  subtables  several 
times,  the  interior  values  will  quite  quickly  converge  to 
values  which  sum  properly  for  each  subtable,  if  the  subtable 


values  are  consistent.  To  the  extent  that  the  subtable 


values  are  inconsistent,  the  interior  values  can  never 
satisfy  all  the  subtable  values.^ 

The  proof  of  convergence  of  the  values  (with  con¬ 
sistent  subtables)  and  an  interesting  interpretation  of  the 
final  values  is  provided  by  some  recent  work  in  information 
theory.  Each  of  the  sub tables  used  in  the  iteration  con¬ 
tains  a  certain  amount  of  information  (in  the  technical 
sense  of  the  word)  about  the  population.  Brown  showed  that 
the  iteration  procedure  converges  with  each  step  to  the 
".  .  .  minimum  information  (i.e.,  maximum  entropy)  exten¬ 
sion  .  .  ."of  these  subtables,  i.e.,  the  only  information 
(non-randomness)  in  the  final  distribution  is  that  con¬ 
tained  in  the  sub tables. 6 

The  minimum  information  values  are  achieved  when 
the  initialization  of  the  iteration  coxvtains  no  information, 
e.g. ,  when  all  1.0's  are  used  as  an  initial  starting  place. 
However,  it  is  possible  to  start  the  iteration  with  other 
values  which  do  contain  information,  and  thereby  achieve 
final  cell  values  which  combine  both  the  information  in  the 

5 

The  iteration  was  first  proposed  Ly  Deming  and 
Stephan  in  1940.  See  W.  Edwards  Deming,  and  Frederick  F. 
Stephan,  "On  a  least  squares  adjustment  of  a  sampled  fre¬ 
quency  table  when  the  expected  marginal  totals  are  known,” 
Annals  of  Mathematical  Statistics,  11  (1940),  427-44;  and 
W.  E.  Deming  and  F.  F.  Stephan,  ^The  sampling  procedure  of 
the  1940  population  census,"  Journal  of  the  American  Sta¬ 
tistical  Association,  35  (194$) ,  615-30. 

g 

David  T.  Brown,  "A  Note  on  Approximations  to  Dis¬ 
crete  Probability  Distributions,"  Information  and  Control, 

2  (1959),  p.  386. 
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subtables  and  that  in  the  initialization.  As  an  example, 
sometimes  we  have  data  about  a  population  which  is  helpful 
ir.\  describing  interesting  interactions,  but  which  fail  to 
agree  with  other,  more  substantial  data  wr.ich  itself  does 
not  give  evidence  about  these  interactions.  We  would  like 
to  combine  both  sets  of  data.  In  later  simulations  of 
audience  exposure  in  Communist  China,  we  shall  make  use  of 
data  from  several  different  sources.  One  source  of  data  is 
census  reports  which  describe  the  population  breakdowns 
along  several  dimensions  for  the  population  of  China.  We 
are  willing  to  make  the  assumption  that  these  census  data  are 
as  accurate  as  any  data  that  we  are  likely  to  have  and  we 
will  accept  these  data  at  face  value.  These  data,  however, 
are  never  found  in  tables  of  dimensionality  higher  than  two 
or  three  and  do  not  represent  the  interactions  between 
sex  and  literacy,  sex  and  education,  age  and  education,  etc. 
We  do  have  other  data  from  interviews  with  refugees  in  Hong 
Kong  in  which  the  interactions  between  the  various  popula¬ 
tion  dimensions  are  measured;  however,  the  marginal  values 
of  this  sample  are  not  representative  of  the  marginal  values 
for  the  population  of  China  as  a  whole.  We  would  like  to 
combine  the  interactions  of  the  sample  with  the  known 
marginals  given  in  the  census  data  for  China. 

We  can  use  the  iteration  to  produce  this  combina¬ 
tion  by  initializing  the  interior  cells  with  the  data 
describing  the  interactions  rather  than  with  all  1.0's 
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(which  imply  no  interaction) .  Then  we  use  the  known  mar¬ 
ginal  data  as  subtables  and  proceed  with  the  iteration. 

In  this  case  the  final  values  are  not  the  minimum  informa¬ 


tion  extensions  from  the  subtables,  but  combine  the  infor¬ 
mation  in  the  two  kinds  of  data.  Mosteller  offers  an 


interpretation  of  this  process : 

...  it  is  reasonable  to  think  of  the  association 
between  [two  variables]  as  independent  of  the  relative 
numbers  [in  the  classes  of  each  variable] .  Why  should 
tripling  the  number  of  A's  and  halving  the  number  of 
not- A' s  in  each  category  .  .  .  have  any  effect  upon  the 
basic  association  in  the  table?  One  reasonable  answer 
is  that  it  should  not.  We  might  think  of  a  contingency 
table  as  having  a  basic  nucleus  which  describes  its 
association  and  think  of  all  tables  formed  by  multiply¬ 
ing  elements  in  rows  and  columns  by  positive  numbers  as 
forming  an  equivalence  class  -  a  class  of  tables  with 
the  same  degree  of  association.  ...  an  index  of  asso¬ 
ciation  [for  a  2x2  table]  which  is  invariant  under 
these  row  and  column  multiplications  is  the  cross- 
product  ratio 


In  Mosteller' s  terms,  the  interviews  of  Chinese 
refugees  capture  (v/e  assume)  the  basic  nucleus  of  associa- 

O 

tion  in  the  population.  We  only  need  calculate  the 


Mosteller,  *!Association,"  p.  4.  The  italics  are 
Mosteller' s. 


We  must  be  cautious  about  this  assumption.  Mos¬ 
teller  notes  that,  "...  making  the  assumption  that  when 
a  s’obgroup  is  formed  from  a  parent  population  the  multi¬ 
plicative  invariance  is  preserved  does  not  make  it  true 
(as  a  vehicle  for  getting  an  estimate  in  ignorance,  it  has 
the  same  status  as  the  use  of  linear  regression  in  the 
absence  of  knowledge  of  the  shape  of  the  function) . " 
Mosteller,  "Association,"  p.  10. 


equivalent  table  having  the  proper  marginals  to  model  this 
9 

population. 

In  the  present  simulation  we  will  use  this  technique 
*or  calculating  the  distribution  of  the  audience  of  the 
?'  ’nday  Enquirer  over  the  population*.  We  have  data  on  the 
distribi  tion  of  the  daily  Enquirer ,  but  none  for  the  Sunday 
Enquirer ;  however,  we  know  that  the  audience  of  the  latter 
is  about  sixty-four  percent  greater  than  that  of  the  former. 
Therefore,  we  have  increased  each  of  the  subtable  values 
for  the  daily  Enquirer  by  this  value  and  used  these  values 
in  the  smoothing  operation  for  the  Sunday  Enquirer .  This 
process  may,  however,  produce  audiences  in  some  cells 
greater  than  the  population.  We  therefore  initialize  the 
iteration  with  the  population  values  to  combine  the  infor¬ 
mation  about  the  distribution  of  the  population  with  the 
audience  estimates. 


The  Consistency  of  Subtables 

In  the  process  of  using  the  parameter  estimation 
technique  at  various  stages  in  the  simulation,  we  have 
discovered  some  very  intriguing  facts  about  the  consistency 


9 

We  often  encounter  the  need  to  synthesize  data  in 
this  manner  *dien  we  are  doing  secondary  analysis  of  survey 
data.  Ithiel  de  Sola  Pool  shows  several  examples  in  his 
"Use  of  Available  Sample  Surveys  in  Comparative  Research," 
Encyclopedia  of  Sociology  (1963),  pp.  16-35.  An  interesting 
discussion  is  also  presented  by  Leslie  Roos  in  Intersurvey 
Comparison  -  A  Progress  Report,  mimeo  (Massachusetts  Insti¬ 
tute  of  Technology,  1967). 
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or  inconsistency  properties  of  subtables.  The  iteration 
must  converge  if  the  tables  used  in  the  iteration  are  con¬ 
sistent.  But,  how  can  we  determine  if  a  set  of  subtables 
is  consistent?  The  iteration  is  prog,  ammed  to  halt  when 
the  differences  between  the  appropriate  sums  and  the  sub¬ 
table  values  are  all  less  than  a  margin  of  error,  epsilon. 
Therefore,  we  could  allow  the  computer  to  make  many  itera¬ 
tions,  and,  upon  failure  to  halt,  we  could  conclude  that 
some  comparable  subtable  values  must  differ  by  more  than 
epsilon.  Actually,  we  first  use  another  algorithm  to  dis¬ 
cover  inconsistencies:  we  take  every  pair  of  subtables 
and  comp  ire  them  to  find  the  common  dimensions.  Then  we 
collapse  each  of  the  subtables  to  the  tables  corresponding 
to  the  common  dimensions  and  compare  these  tables  cell  by 
cell.  If  we  discover  differences  greater  than  epsilon,  we 
reject  one  of  the  tables. 

Now,  at  first  blush  it  would  seem  that  the  procedure 
of  making  every  possible  cell  by  cell  comparison  would 
eliminate  any  inconsistencies  in  the  data.  However,  it 
turns  out  that  this  is  not  the  case.  In  fact,  there  are 
certain  combinations  of  numbers  and  dimensionalities  of 
subtables  which  can  produce  inconsistencies  even  when  this 
cell  by  cell  comparison  has  been  made  for  every  pair  of  the 
subtables.  We  show  such  a  pathological  example  below. 

Here  the  faces  of  the  cube  represent  the  values  of  the 
three  two-dimensional  subtables.  If  we  compare  the 
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marginals  for  each  of  the  three  two-dimensional  subtables, 
we  will  find  that  the  marginals  are  equal;  i.e.,  there  are 
no  obvious  inconsistencies  in  these  tables.  However,  it  is 
easy  to  see  that  these  tables  cannot  be  produced  by  any  set 
of  eight  positive  interior  values. 


Fig.  II-2.  A  Pathological  Cube. 


This  is  a  rather  amazing  inconsistency.  It  means 
that  social  scientists  can  bring  sets  of  data  from  various 
sources  together  which  on  the  face,  applying  every  reason¬ 
able  test,  appear  to  be  consistent  and  yet  have  a  basic 
inconsistency  within  them.  It  seems  from  our  experiments 
with  these  data  that  this  situation  arises  whenever  the  sub- 
tables  are  such  that  they  completely  enclose  the  space  of 
the  grand  table.  For  example,  in  the  case  of  the  cube 
above,  the  three  two-dimensional  tables  completely  enclose 
the  space  of  the  cube.  Any  two  of  these  tables  taken  to¬ 
gether  do  not  enclose  the  space  of  the  cube  and,  in  fact, 
do  not  produce  any  inconsistency.  Likewise,  we  have  found 
that  four  three-dimensional  tables,  which  completely  enclose 
a  four  dimensional  space,  also  often  generate  these 
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inconsistencies.  At  one  point  in  the  programs  which  account 
for  the  duplication  data,  we  must  combine  several  three- 
dimensional  tables,  and  these  inconsistencies  have  arisen. 

At  this  point  we  throw  out  some  of  the  tables  until  we 
arrive  at  a  subset  of  the  subtables  which  are  consistent. 

The  decision  about  wnich  tables  are  thrown  out  is  made  on 
an  intuitive  basis  by  the  researcher  in  interaction  with 
the  computer,  as  we  have  not  yet  been  able  to  formulate  a 
way  to  decide  which  table  is  the  best  table  to  throw  out 
initially. 

Since  it  is  so  difficult  to  detect  these  inconsis¬ 
tent  tables,  we  have,  in  the  case  of  the  duplication  rou¬ 
tines,  introduced  a  second  attempt  to  find  these  inconsis¬ 
tent  tables.  It  may  be  obvious  to  the  reader  that  the 
values  in  the  grand  table  are  related  to  the  values  in  the 
subtables  oy  a  series  of  linear  equations.  For  example, 
in  the  case  of  the  cube  above,  we  can  relate  the  values  in 
the  interior  of  the  cube  to  the  values  in  the  faces  by  a 
set  of  seven  simultaneous  equations.  Since  these  equations 
contain  eight  unknowns,  there  is  no  unique  solution  to  the 
equations,  but  an  infinite  number  of  solutions  to  the  equa¬ 
tions,  given  that  the  faces  of  the  cube  are  consistent.  If 
the  faces  of  che  cube  are  inconsistent,  however,  there  is 
no  solution.  Therefore,  in  the  duplication  routines  we  use 
a  linear  programming  algorithm  to  find  a  feasible  solution 
to  the  equations  or  to  find  that  there  is  no  feasible 
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solution  to  the  equations.  If  the  linear  programming  code 
declares  that  there  is  no  feasible  solution  to  the  set  of 
equations,  we  know  that  the  tables  themselves  are  inconsis¬ 
tent. 

Thus,  the  requirement  that  the  tables  enclose  the 
m-dimensional  space  of  the  grand  table  in  order  for  incon¬ 
sistencies  to  occur  is  somehow  related  to  the  possibility 
of  having  no  feasible  solution  in  a  set  of  linear  equations. 
We  nav'e  not  explored  this  further,  but  we  would  suppose 
that  in  the  algebra  of  linear  equations  there  is  some 
theorem  about  the  possibility  of  no  feasible  solution  which 
is  also  interpretable  as  a  theorem  about  when  a  set  of 
equations  encloses  a  hyperspace. 

We  offer  one  more  observation  about  the  relationship 
between  the  iteration  technique,  solutions  of  linear  equa¬ 
tions,  and  maximum  entropy  extensions  of  probability  dis¬ 
tributions.  Brown's  proof  of  convergence  of  the  algorithm 
rests  on  the  fact  that  at  each  stage  of  the  iteration,  the 
entropy  of  the  resulting  distribution  must  be  larger  than 
the  entropy  at  the  preceding  stage.  Therefore,  one  possi¬ 
bility  for  a  test  for  consistency  of  the  tables  (or  for  a 
test  of  when  to  stop  iterating)  is  to  calculate  at  the  end 
of  each  iteration  cycle  -*-he  entropy  of  the  resulting  dis¬ 
tribution  and  see  if  it  is  in  fact  larger  than  at  the  end 
of  the  previous  cycle.  It  would  seem  that  at  a  point  where 
the  entropy  has  not  increased,  we  either  have  reached  the 
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optimal  solution  (or  at  any  rata  a  solution  which  cannot  be 
carried  any  further  because  of  the  problem  of  significant 
digits  and  rounding  error) ,  or  we  have  reached  the  point 
where  the  inconsistencies  in  the  tables  preclude  any  further 
approximation  to  the  final  distribution. 

Now  consider  the  relationship  to  the  set  of  linear 
equations.  The  iteration  appears  to  be  a  new  way  of  find¬ 
ing  a  feasible  solution  to  a  set  of  linear  equations  or  of 
finding  that  there  is  no  feasible  solution.  If  any  set  of 
l’rear  equations  can  be  thrown  into  this  form,  then  perhaps 
it  would  be  worthwhile  to  compare  the  speed  of  this  solution 
with  that  of  the  standard  Simplex  solution. 

We  have  digressed  for  a  moment  to  explain  in  detail 
one  of  the  most  fundamental  simulation  routines  and  a  gen¬ 
eral  estimation  process  which  seems  to  have  desirable  fea¬ 
tures  in  several  different  kinds  of  problems.  Now  we  turn 
to  the  actual  modeling  of  the  Cincinnati  mass  media  system 
of  1947-48. 


CHJkPTER  III 


CONSTRUCTING  T#B  MEDIA  SYSTEM  I: 

THE  SIMULATION  POPULATION  AND  THE  NEWSPAPER  AUDIENCES 

The  Role  of  a  Population  in 
Simulations 

Some  simulations — we  are  thinking  here  particular¬ 
ly  of  simulations  of  economic  systems — relate  dependent 
and  independent  variables  quite  directly  through  systems 
of  equations.  The  data  describing  the  independent  vari¬ 
ables  are  measured  or  assumed  by  the  researcher  and  the 
implied  values  of  the  dependent  variables  follow  rather 
directly  from  these  equations.  In  these  simulations, 
some  body  of  theory  rather  closely  relates  the  values  of 
the  independent  and  dependent  variables .  These  simula¬ 
tions  do  not'  seem  to  be  characterized  by  what  one  might 
call  sub-  or  micro-elements  or  a  population,  by  which  we 
mean  components  of  the  model  which  are,  in  some  theoreti¬ 
cal  scheme,  at  a  lower  level  of  analysis  than  other  ele¬ 
ments.  Thus,  sectors  of  an  economy,  or  individual  con¬ 
cerns,  or  a  number  of  (real  or  symbolic)  people  comprising 
a  simulation  population  may  function  at  a  lower  level  of 
analysis  than  that  at  which  one  analyzes  the  outputs  of 
the  simulation.  These  outputs  may  be  in  terms  of  some 
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net  or  aggregate  statistics  or  conditions  for  the  system, 
e.g.,  stability  or  instability,  total  income  or  G.N.P., 
or  exposure;  or  in  terms  of  comparisons  of  such  aggre¬ 
gates  among  population  subgroups,  e.g.,  income  and  G.N.P. 
of  sectors  of  the  economy,  or  exposures  of  males  or 
females;  the  outputs  are  usually  not  in  terms  of  the 
smallest  population  element.  This  is  true  for  the  mass 
media  simulation;  the  outputs  which  we  wish  to  analyze 
are  not  in  terms  of  individual  members  of  the  population 
but  rather  in  terms  of  the  various  subgroups. 

There  are  at  least  three  reasons  why  it  may  be 
necessary  to  define  elements  of  a  simulation  at  a  level 
so  removed  from  the  statistical  outputs  that  they  are 
never  used  for  analysis: 

1.  It  may  be  (as  in  the  present  simulation)  that 
the  input  data,  although  aggregated  over  sub¬ 
groups  or  the  entire  population,  come  in  vari¬ 
ables  too  diverse  to  be  dealt  with  at  the 
aggregate  level,  given  the  present  state  of 
the  theory.  The  theory  often  relates  to  a 
lower  level  and  therefore  the  data  must  be  dis¬ 
aggregated,  hung  (so  to  speak)  on  some  set  of 
population  points ,  and  later  recombined  or  in¬ 
tegrated.  Thus  in  the  present  simulations  we 
proceed  from  the  level  of  population  and  audience 
subgroups,  total  cumulations ,  and  duplication. 
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down  to  hypothetical  individuals  with  appropriately 
arranged  exposure  probabilities;  we  then  add. a 
driving  mechanism,  the  message  scenario,  to  get 
individual  exposures ,  and  then  reaggrege  te  to 
get  exposures  again  at  the  level  of  the  group 
or  the  total  population.  The  individual  points 
do  not  perform  any  action  or  interact  with  each 
other  at  all.  We  might  characterize  these  as 
disaggregative-integrative  simulations,  or  con¬ 
sistency  machines. 

2.  It  may  happen  that  the  macro-level  theory  is 
well  developed,  but  that  the  data  are  available 
only  at  a  lower  level,  therefore  making  it  neces¬ 
sary  to  model  at  the  lower  level  and  then  aggre¬ 
gate.  This  would  also  allow  exploration  of  the 
effects  of  changes  at  the  lower  level  (e.g.  in  1 

a  particular  market)  on  the  aggregate  system. 

3.  The  researcher  may- be  interested  in  complex  pro’h- 
cesses  at  a  lower  level,  (partially)  validating 
them  by  the  resulting  aggregate  output.  This  is 
often  the  case  when  the  population  er  sub¬ 
elements  are  in  interaction  or  "coupled"  with 
each  other.  Examples  of  this  kind  of  simulation 
include  the  various  models  of  small  groups  and 
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the  Abelson  and  Bernstein  fluoridation  simulation.1 

In  the  first  two  cases  above,  where  the  population 
is  uncoupled,  the  researcher  should  consider  whether  he 
might  not  be  able  to  describe  the  relevant  population 
subgroups  using  mathematical  functions,  rather  than  with 
individual  population  points.  In  the  present  simulation 
this  would  seem  to  be  a  just  feasible  task;  for  coupled 
populations  uie  mathematics  are  probably  too  complex. 
Nevertheless,  as  researchers  become  more  familiar  with 
modeling  and  more  sophisticated  mathematically,  simula¬ 
tions  such  as  this  will  probably  be  more  completely  des¬ 
cribed  in  terms  of  mathematical  functions,  with  a  re¬ 
sulting  great  economy  of  running  time,  increase  flexi¬ 
bility,  and  ease  and  clarity  of  analysis. 

In  a  simulation  using  an  uncoupled  symbolic 
population,  it  seems  unnecessary  that  the  distribution 
of  the  population  points  match  either  exactly  or  pro¬ 
portionately  the  distribution  of  the  modeled  population. 

In  a  coupled  population  where  the  data  points  interact. 


1Abelson  makes  the  important  distinction  between 
coupled  and  uncoupled  populations  in  his  chapter  on  simu¬ 
lation  for  the  revised  Handbook  of  Social  Psychology, 
Lindzey,  Gardner  and  Aronson,  Eliot  (editors) .  In  press , 
Addison -Wesley.  A  description  of  the  fluoridation  simu¬ 
lation  is  found  in  Robert  P.  Abelson  and  Alex  Bernstein, 
"A  Computer  Simulation  Model  of  Community  Referendum 
Controversies,"  Public  Opinion  Quarterly,  XXVII,  (1963) 
93-122. 
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there  may  be  for  some  processes  a  "critical  mass"  which 
quite  changes  the  behavior  of  the  system  and  therefore 
necessitates  a  scale  model  {at  least)  of  the  real  popula¬ 
tion.  However,  for  a  stochastic  simulation  like  the  pre¬ 
sent  one,  it  may  be  much  more  efficient  to  use  a  "strati¬ 
fied  sample" — even  equal  numbers  in  each  population  subgroup 
or  cell — if  we  wish  to  do  a  detailed  analysis  of  some 
important,  but  proportionately,  small,  subgroup.  The 
computer  is  just  the  tool  to  carry  out  complex  weightings 
of  population  subgroups  in  calculating  aggregate  statis¬ 
tics.  Of  course,  if  each  population  subgroup  is  described 
by  some  mathematical  function  rather  than  actually  pro¬ 
duced  as  individual  points,  then  these  functions  carry 
the  subgroup  weightings. 

With  these  preliminary  remarks,  we  turn  now  to 
the  actual  construction  of  the  Cincinnati  mass  media 
simulation  population. 

The  Cincinnati  Simulation  Population 

The  simulation  population  consists  of  a  certain 
number  of  hypothetical  persons  distributed  throughout  the 
space  defined  by  the  attributes  or  variables  which  have 
been  selected  as  relevant  to  the  process  being  modeled. 

The  points  in  this  property  space  (we  will  usually  call 
them  "cells"  or  "population  types")  represent  all  possible 
combinations  of  the  levels  or  categories  of  all  attributes. 
Thus,  a  typical  population  type  might  consist  of  all 


97 


young,  high  school  educated,  middle  status,  males  who  are 

i 

\  interested  m  international  affairs. 

r 

\ 

I  Since  the  present  simulation  model  was  not  pro¬ 

grammed  to  allow  weighting  of  cells  or  audience  types,  it 
would  seem,  for  the  Cincinnati  simulation,  that  we  would 
like  to  replicate,  in  miniature,  the  distribution  across 
cells  of  the  1947-48  Cincinnati  population.  However,  the 
choice  of  a  population  base  for  the  simulation  is  really 
not  this  straightforward.  Actually  the  target  populations 
surveyed  by  NORC  and  the  Census ,  and  by  several  of  the 
other  surveys  to  be  presented  below,  were  not  identical. 
During  the  1940  census,  a  certain  urban  area  around  the 
city  formed  what  was  known  as  the  Cincinnati  Metropolitan 
District.  This  was  the  basis  also  of  the  1947-48  NORC 
surveys.  However,  for  the  1950  census,  which  probably 
is  a  better  representation  of  the  1947-48  Cincinnati  popu¬ 
lation,  the  boundaries  of  this  area  were  slightly  changed 
and  the  new  unit  was  called  the  Cincinnati  Metropolitan 
Area.  Also,  the  base  for  reporting  newspaper  ci  .ation 
figures  and  for  several  Times-Star  audience  surveys  was  a 
slightly  different  area  called  the  Cincinnati  City  Zone. 
Finally,  the  Hooper  ratings  used  in  calculating  the  radio 
audiences  are  produced  from  a  sample  of  the  telephone 
homes  located  in  the  ncu-koll-cali  area  of  the  city. 

In  order  to  simplify  matters,  we  shall  ignore 
these  small  differences  in  the  target  populations  for 


these  surveys ,  and  turn  to  the  obvious  larger  differences, 
the  errors  in  the  NORC  sample,  and  (in  Chapter  IV)  the 
Hooper  sample  for  radio  audiences. 

The  NORC  sample,  even  after  weighting,  was  not 
an  accurate  representation  of  the  Cincinnati  population. 
Tables  III-2  and  III-3  show  the  breakdown  according  to  the 
September  1947  NORC  sample  and  according  to  1950  census 
figures  for  the  Cincinnati  Metropolitan  Area.  It  can  be 
seen  that  in  several  ways  the  sample  and  the  population 
differ.  People  with  only  grade  school  education  are  sig¬ 
nificantly  underrepresented  in  the  sample  and  younger  peo¬ 
ple  are  also  slightly  underrepresented. 


Table  III-l.  Distributions  of  the  NORC  sampJe  by  sex. 
Age,  and  Education. 


Male 

Age 

Education  21-39  40- 


Marginal 

Totals 


Female 

Age 

21-39  40- 


Marginal 

Totals 


College 


6.35%  12.07 


2.18% 


G.S. 

Marginal 


4.99 

10.80 

15.79 

4.81 

15.51 

20.32 

22.41 

24.32 

46.75 

25.15 

28.12 

53.25 

<N=515> 

(N=587) 

(N=1120) 


The  entries  are  percentages  of  the  weighted  sample 
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Table  III-2.  Distribution  of  the  Adult  Population  of  the 

Cincinnati  Metropolitan  Area  by  Sex,  Age,  ar-1 
Education  According  to  the  1350  Census. 


Education 

Male 

Female 

Age 

Age 

Marrri  nal 

21-39 

40- 

Totals 

21-39 

40- 

Totals 

College 

4 . 17%a 

3.03%  7.21% 

3.16% 

2.55% 

5.71% 

H.S. 

9.73 

6.89 

16.63 

13.25 

8.80 

22.05 

G.S. 

6.33 

16.85 

23.18 

6.50 

18.70 

25.20 

Marginal 

Totals 

20.23 

26.77 

47.02 

22.91 

30.05 

52.96 

aThe  entries  are 

percentages  of  the 

total  population 

(N=617,548) . 

Table  III- 

3 .  Percentage 

of  the  Ma -ch,  1948 

NORC  Sample  Re- 

porting  Exposure  to  at  Least  Three  M.edia,  by 

Education, 

Age  end  Sex. 

Percentage  Exposed  to 

Dimension 

At  Least  Three 

Media 

Education 


College  68 

High  School  43 

Grammar  School  17 

Age 

21-33 
40- 

Sex 

Male 
Female 


44 

32 


These  differences  are  troublesome  for  the  following 
reasons.  On  the  one  hand,  the  surveys,  circulating  figures, 
etc.,  by  which  the  audiences  and  eventual  exposures  are 
calculated,  are  presumably  valid  for  the  Cincinnati  popu¬ 
lation  as  described  by  the  1950  census.  Cn  the  other  hand, 
after  processing  all  the  data  through  the  simulation,  we 
would  like  to  correlate  the  theme  exposures  of  the  simu¬ 
lated  population  with  the  changes  in  information  and 
opinions  which  were  measured  ny  th  .  NOLC  survey.  Thus,  if 
we  model  the  population  as  recorded  by  the  census,  the 
circulation  and  audience  statistics  are  presumably  directly 
valid  as  input,  but  the  output  must  be  interpreted  for  pur¬ 
poses  of  comparison  with  the  panel  data  as  rates  of  exposure 
(or  percentage  exposed)  for  each  subgroup.  Conversely,  if  the 
survey  panel  is  modeled  in  the  simulation,  then  the  input 
data  must  be  modified  so  as  to  be  valid  for  this  different 
population  configuration.  This  is  the  course  which  has 
been  followed  here.  Since  much  of  the  data  used  to  estab¬ 
lish  newspaper  audience  ratings  for  the  population  types  is 
provided  by  the  NORC  survey  itself,  and  since  audience  data 
for  the  radio  is  known  only  by  sex — a  breakdown  correctly 
reproduced  by  the  NORC  weighted  panel — we  have  used  the 
weighted  NORC  data  for  construction  of  the  simulation  popu¬ 
lation.  Essentially,  the  audience  input  data  is  so  poor 
that  it  hardly  justifies  constructing  a  simulation  popula¬ 
tion  exactly  modeling  that  of  the  Cincinnati  Metropolitan  Area. 


The  dimensions  chosen  to  describe  the  simile. tion 
population  were  sex,  age,  education,  socioeconomic  status, 
and  initial  (September)  level  of  interest  in  inter national 
affairs.  All  of  these  dimensions  were  recorded  in  the  NORC 
survey.  In  an  entirely  urban  population,  these  dimensions 
seemed  the  most  critical  for  an  explanation  of  exposure  to 
international  news.  For  example,  the  report  on  the  Cin¬ 
cinnati  Plan  found  that 

Among  those  rated  as  'interested*  in  September  by 
virtue  of  their  having  expressed  'keen'  interest 
in  two  or  more  of  four  given  international  topics, 

47  percent  reported  exposure  to  three  or  more  of 
the  media  by  March;  whereas,  among  those  relatively 
'uninterested,'  29  percent  did. 2 

The  March  survey  also  showed  the  effect  of  education,  age, 

and  sex  on  media  exposure  (Table  III-3) . 

Table  III-4  indicates  the  levels  and  rarginals  for 
each  of  these  dimensions.  This  combination  of  dimensions 
and  levels  divides  the  population  into  144  populatior  types. 

The  NORC  Septt  ruber  survey  before  weighting  was  a 
sample  of  size  745.  In  weighting  to  match  the  population, 
many  respondents’  cards  were  duplicated  so  that  the  result¬ 
ing  weighted  sample  size  was  11ZG.  dividing  this  number  by  s 

the  144  audience  types  gives  an  a rerage  of  only  7.8  weighted 
panel  members  per  audience  type.  With  cells  both  well  above  ! 


2 

The  quotation  and  table  are  taken  from  Shirley  A. 
Star  and  Helen  MacGill  Hughes,  Report  on  the  Cincinnati 


Plan,  p.  9 


Table  III-4.  Marginal  Percentage  Breakdowns  from  the  NORC 
Survey  for  the  Dimensions  of  the  Simulation 
Population . 


Dimension 


Percenreae 

Marginal 


Male 

Female 


46.8 

53.2 


21-3L 

40- 


47.7 

52.4 


Education 


College 
High  School 
Grade  School 


18.1 

45.7 

36.1 


S. E.S. 


High 

Middle 

Low 


19.2 

62.2 
18.7 


September  International 
Affairs  Interest 


Highest 
Upper  Middle 
Lower  Middle 
Lowest 


25.9 

22.0 

28.1 

24.0 


and  below  this  average,  we  obviously  are  estimating  the  size 
of  many  cells  from  a  very  small  sample.  Thus,  this  is  just 
the  situation  to  use  the  Mosteller  parameter  estimation 
technique.  This  we  have  done,  constructing  the  simulation 
population  from  the  10  possible  three-dimensional  subtables. 

Finally,  in  order  to  reduce  the  variance  due  to  the 
stochastic  procedures  of  the  simulation  (e.g.,  the  random 
assignment  of  probabilities  within  cells)  we  have  chosen 
the  size  of  the  simulation  population  to  be  2,000  computer 
individuals  distributed  across  the  144  audience  types. 

The  Ye • ' cle  Audience  Distributions 

Having  constructed  the  simulation  population,  we 
now  f i.’mate  the  breakdown  across  the  population  types  of 
the  average  audience  of  each  of  the  vehicles.  For  each  of 
these  types,  the  number  in  the  vehicle  audience  divided  by 
the  total  number  of  that  type  provides  a  mean  probability 
of  exposure  or  audience  rating  for  the  vehicle  and  audience 
type.  First  we  turn  to  the  newspapers. 

The  Cincinnati  Newspaper  Audiences 

There  were  in  Cincinnati  at  the  time  of  the  NO..C 
Survey  three  uaily  newspapers,  the  Enquirer ,  the  Times-Star , 
and  the  Post.  The  Enquirer  was  a  morning  newspaper;  the 
Times-Star  and  '-.he  Post  were  evening  newspapers.  In  addi¬ 
tion  to  the  Monday  through  Saturday  issues  >ch  of  these 


newspapers,  the  Enquirer  published  a  F  uiday  newspaper  which 


we  have  included  as  a  fourth  newspaper  vehicle.  Thus  we 
wish  to  estimate  the  distribi  tion  of  the  average  audience 
of  each  of  these  four  vehicles  over  the  144  possible  audi¬ 
ence  types. 

The  NORC  survey  asked  about  each  respondent's  daily 
reading  habits  and  found  the  fundamental  data  from  which 
each  audience  distribution  has  been  estimated.  The  ques¬ 
tion  asked  of  each  respondent  was  "Do  you  usually  read  a 
daily  newspaper?";  if  "yes,"  "Which?".  The  readership 
found  for  each  of  the  daily  newspapers  was:  for  the  En¬ 
quirer  ,  43  percent;  for  the  Post,  52  percent;  and  for  the 
Times-Star ,  55  percent.  Four  percent  of  the  population  re¬ 
ported  reading  a.  y  other  newspaper,  and  thr-  e  percent  read 
no  daily  newspaper.  Because  of  the  wording  of  the  question 
these  audience  percentages  are  almost  surely  slightly  high. 
Generally,  the  most  accurate  method  of  measuring  the  aver¬ 
age  occurrence  of  a  frequently  recurring  event  is  to  identi- 

3 

fy  a  fixed  period  of  time  for  the  measurement.  In  media 
research  the  tactic  is  to  ask  about  the  exposure  to  the  last 
issue  of  the  vehicle;  for  daily  newspapers  this  means  asking 
about  exposure  to  yesterday's  (or  Saturday's)  issue.  Thus 

3 

If  there  is  a  significantly  large  proportion  of 
the  population  which  is  exposed  to  about  half  of  the  issues 
and  whose  answer  to  the  question  as  phrased  below  is  likely 
"yes,"  then  the  resulting  percentage  will  significantly 
overestimate  the  average  audience. 


1»*  '• 
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the  standard  newspaper  readership  question  is  of  the  form 

4 

"Which  daily  newspaper  or  newspapers  did  you  read  yesterday?" 

Not  only  are  the  NORC  readership  figures  for  each 
newspaper  probably  too  high,  but  the  form  of  the  question 
probably  also  causes  an  underestimation  of  multiple  news¬ 
paper  readership.  However  the  disproportionate  sampling  of 
the  higher  educated,  higher  status  members  of  the  popula¬ 
tion,  i.e.,  of  those  who  are  more  likely  to  read  more  than 
one  newspaper,  may  compensate  to  some  degree  for  the  bias 
introduced  by  the  wording  of  the  question. 

Because  of  the  problems  of  the  NORC  data,  the  small 
sample  size,  the  method  cf  drawing  the  sample,  and  the 
phrasing  of  the  question,  it  would  be  helpful  to  have  addi¬ 
tional  evidence  about  the  newspaper  exposure.  This  evidence 
is  available  from  two  audience  surveys  conducted  by  the  Cin¬ 
cinnati  Times-Star,  one  in  1947  and  the  second  in  1951. 

These  were  essentially  q’.uta  samples  of  housewives  living 
in  the  Cincinnati  City  Zone.^  The  question  asked  was 

4 

In  the  aided  recall  method,  copies  of  each  of  the 
previous  day's  local  newspapers  are  shown  to  the  respondent. 
This  aid  usually  produces  a  somewhat  higher  readership. 

5 These  surveys  are  described  in  detail  in  The  Flow 
of  Retail  Buying  Traffic  in  Cincinnati,  1957:  Fifth  Survey, 
The  Cincinnati  Post,  (Cincinnati,  Ohio;  1847)  and  Buying 
Habits  Survey;  The  Cincinnati  Market  (1951),  the  Cincinnati 
Post  (Cincinnati,  Ohio;  March  1952) .  The  samples  were 
quota  samples  of  all  houserives  in  the  Cincinnati  City  Zone, 
controlling  for  economic  groups,  geographic  location,  race, 
and  types  of  residence.  The  sample  size  for  the  1947  sur¬ 
vey  was  2500  and  for  the  1951  survey  was  4588. 
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"Which  daily  newspaper  or  newspapers  were  read  in  your  home 
yesterday?"  This  phrasing  of  the  question  probably  also 
biases  the  readership  upward  insofar  as  every  adult  member 
of  the  family  was  not  a  reader  of  the  paper.  The  data  for 
the  three  surveys  are  presented  in  Tables  III-5  and  III-6 
below. 


Table  III-5.  Total  Readership  of  Each  Daily  Newspaper 
According  to  Three  Surveys 


Survey 

Total  Readership 

Enquirer 

Times-Star 

Post 

Times-Star,  1947 

37. 44%a 

54.84% 

51.12% 

NORC,  September,  1947 

43.50 

55.30 

52.40 

Times-Star,  1951 

42.70 

48.85 

48.43 

aData  represent  percentages  of  the  adult  population. 


In  examining  these  tables,  we  probably  should  dis¬ 
count  somewhat  the  data  from  the  Times-Star  survey  of  1951 
since  we  might  reasonably  expect  some  change  in  readership 
habits  over  the  four-year  interval  from  1947  to  1951.  For 
the  two  1947  surveys  we  find  reasonably  close  agreement 
except  for  two  cases,  first  for  the  readership  of  all  three 
newspapers  for  which  the  NORC  survey  finds  approximately 
twice  the  readership  (14.12%)  of  the  Times-Star  survey  and 
second,  for  the  average  audience  of  the  Enquirer,  which  the 


Data  represent  percentages  of  the  adult  population. 
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Times-Star  survey  finds  to  be  about  37.4  per  cent,  and  which 
the  NORC  survey  finds  to  be  43.5  per  cent.  The  first  dis¬ 
crepancy,  the  finding  that  the  triple  readership  is  larger 
in  the  N^RC  survey  than  in  t.ie  Times-Star  survey,  comes 
somewhat  as  a  surprise,  since  the  question  phrasing  for 
the  NORC  survey  would  seem  to  bias  the  triple  readership 
downward.  Perhaps  the  discrepancy  occurs  because  of  the 
biases  in  the  NORC  sample  although  it  is  difficult  to 
imagine  that  the  NORC  probability  sample,  even  though  it 
is  a  smaller  sample  than  the  Times-Star  sample,  was  less 
accurate  a  representation  of  the  population  than  the  Times- 
Star  quota  sample. 

To  summarize  our  feelings  about  the  data  at  this 
point,  we  would  say  that  the  average  audiences  are  prob¬ 
ably  somewhat  too  high  because  of  the  phrasing  of  the  sur¬ 
vey  questions  and  this  holds  for  all  three  surveys.  For 
the  NORC  survey  it  seems  reasonable  to  suspect  that  the 
measurement  of  the  triple  readership  is  significantly  too 
high. 

We  do  have  the  results  of  one  other  survey  which 
somewhat  corroborates  the  two  findings  in  the  paragraph 
above.  This  study,  A  National  Study  of  Newspaper  Reading: 
the  Functions  of  Newspapers  for  Their  Readers,  was  con¬ 
ducted  during  March  and  April,  1961,  by  the  Audits  and 
Surveys  Company  from  a  national  sample  of  newspaper 
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readers.6  In  this  study  the  combination  of  two  samples 
upon  which  most  of  the  data  is  based  totaled  2,449  house¬ 
holds,  with  4,826  individual  respondents.  From  a  natioKial 
survey  we  are  not  able  to  ascertain  the  readership  of  in¬ 
dividual  newspapers,  such  as  the  Cincinnati  newspi p»rs; 
however,  we  do  have  data  on  the  number  of  newspapers  read 
by  various  percentages  of  the  population.  (Of  course, 
when  we  look  at  this  data,  we  must  remember  that  it  is 
national  data  taken  during  the  year  1961,  that  is,  four¬ 
teen  years  after  the  time  of  the  other  surveys.)  Ta±>le 
III-7  below  shows  readership  by  number  of  newspapers  for 
each  of  the  four  surveys. 


Tahle  III-7.  Average  Multiple  Newspaper  Readership  in 
Four  Surveys 


Number  of 
Newspapers 

Survey 

Times-Star 

NORC 

Times-Star 

National  Study 

Read 

- I5T7 - 

1947 

- I35T - 

1961 

0 

2.52%a 

3.30% 

3.07% 

20.3% 

1 

58.76 

56.03 

60.31 

53.1 

2 

31.72 

26.57 

30.19 

22.5 

3  or  more 

7.00 

14.10 

6.43 

4.1 

Total 

100.00% 

(N=2500) 

100.00% 

(N=745) 

100.00% 

(N=4588) 

100.00% 

(N=4368) 

aData  are  percentages  of  the  population. 


c 

"Audits  and  Surveys  Company ,  Inc . ,  A  National  Study 
of  Newspaper  Reading:  the  Functions  of  Newspapers  for  Their 
Readers  (New  York;  Audits  and  Surveys  Company,  Inc.,  1961). 


The  1961  national  study  found  generally  smaller 
multiple  readership  and  generally  higher  non-readership  of 
the  average  daily  newspaper.  Probably  most  of  the  differ¬ 
ence  between  the  1961  national  study  and  the  other  studies 
is  due  to  changes  in  readership  during  the  intervening 
fourteen  years.  By  1961  there  were  far  fewer  three- 
newspaper  cities  and,  in  addition,  it  is  well  Known  that 
newspaper  readership  has  not  kept  pace  with  the  growth  of 
the  population  in  the  last  twenty  years.  However,  the 
1961  study  does  add  another  piece  of  evidence  which  seems 
to  indicate  the  possibility  of  an  overestimation  of  reader- 
ship  on  the  part  of  the  first  three  surveys. 

There  exists  one  more  piece  of  evidence  which  seems 
to  indicate  that  the  estimates  of  readership  according  to 
the  1947  NORC  survey  are  somewhat  high.  From  the  1950  cen¬ 
sus  we  find  the  number  of  adults  living  in  a  household  in 
Cincinnati  was,  on  the  average,  about  2.15  adults  per 
household.  (There  are  two  ways  to  calculate  the  number  of 
adults  per  household,  depending  essentially  on  the  defini¬ 
tion  of  a  household.  The  two  calculations  give  a  minimum 
number  of  adults  per  household  as  2.12  adults  per  household 
and  a  maximum  number  of  2.23  adults  per  household.)  The 
1961  readership  study  found  2.04  adult  readers  per  news¬ 
paper  copy.  This  figure  seems  quite  suggestive  if  there 
were,  nationally,  about  2.15  adults  per  household.  One 
might  conceive  of  almost  every  copy  of  a  newspaper  sold 
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going  into  a  household  and,  in  general,  the  average  adult 
in  the  household  reading  the  newspaper.  From  the  city 
zone  average  circulation  of  Cincinnati  newspapers  for  the 
year  ending  March  31,  1948,  and  the  NORC  audience  esti¬ 
mates,  we  have  calculated  the  number  of  adult  readers  per 
copy  of  the  newspaper.  For  the  three  newspapers,  these 
figures  come  out  remarkably  close  together.  For  the  Post 
and  the  daily  Enquirer,  the  estimate  gives  2  ..dult 
readers  per  copy;  for  the  Times-Star ,  the  figure  is  2.58 
adult  readers  per  copy  of  the  newspaper.  Although  these 
figures  are  very  close,  they  exceed  substantially  the  fig¬ 
ure  of  2.04  adult  readers  per  copy  found  in  the  1961  survey 
Moreover,  they  also  exceed  the  estimate  of  2.15  adults  per 
household  in  Cincinnati  in  1950 .  Thus  this  bit  of  evidence 
also  indicates  the  possibility  of  the  NORC  survey's  over¬ 
estimation  of  the  average  newspaper  readership.  (Of  course 
we  must  realize  that  the  adult  readers  projected  from  the 
NORC  survey  are  not  the  ordinary  adults  from  the  Cincinnati 
population  since  the  NORC  survey  is  not  a  faithful  reflec¬ 
tion  of  the  Cincinnati  population.  The  NORC  adults  are 
more  likely  to  read  newspapers ,  and  therefore  we  might  ex¬ 
pect  the  number  of  adult  readers  per  copy  to  be  higher 
among  this  group  than  among  the  population  of  Cincinnati 
as  a  whole. ) 

To  sum  up,  it  seems  hard  to  believe  that  on  the 


average  day  only  3.3  per  cent  of  the  NORC  population  fails 
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to  read  at  least  one  newspaper.  Therefore,  we  have  some¬ 
what  arbitrarily  decided  that  this  figure  should  be  in¬ 
creased  to  ten  per  cent  of  the  adult  population.'7 

The  correction  is  not  as  simple  as  it  would  first 
appear.  The  procedure  finally  decided  upon  is  as  follows? 
we  assume  that  everyone  who  answered  ’’no"  to  the  question 
by  readership  of  a  newspaper  did  indeed  not  read  the  news¬ 
paper.  However,  we  shall  assume  that  a  constant  proportion 
of  those  people  who  did  answer  "yes"  to  the  question  of 
newspaper  readership  were  not  in  fact  in  the  average  audi¬ 
ence  of  the  newspaper.  This  constant  proportion  then  rep¬ 
resents  a  probability  of  not  being  in  the  newspaper's 
average  audience  given  that  the  respondent  answered  "yes" 
to  the  question  about  newspaper  readership.  Then,  for 
example,  the  number  of  people  in  the  group  who  claimed  to 
read  three  newspapers,  but  who  in  fact  on  an  average  day 
read  no  newspapers,  is  the  number  claiming  multiplied  by 
the  cube  of  the  probability.  Likewise,  the  number  who 
claimed  to  read  two  newspapers  but  in  fact  read  no  news¬ 
papers  on  an  average  day  is  the  claiming  number  multiplied 
by  the  square  of  the  probability.  If  we  let  represent 

the  percentage  claiming  to  have  read  i  newspapers  (i  =  0, 

1,  2,  or  3),  we  arrive  at  the  following  cubic  equation 
which  must  be  solved  for  q,  the  probability  of  not  reading, 
given  a  "yes"  answer: 

7 

The  1961  national  survey  also  found  in  cities  of 
500,000  or  more  than  16.1  per  cent  of  the  adults  were  non- 
readers  of  a  newspaper  on  an  average  day. 


or 


10%  =  K3q  +  K2q  +  I^q  f  KQ 

10  =  14 . 13q3  +  26 . 60q2  +  56.06q  +  3.21  . 

An  approximate  solution  to  this  equation  is  a  value  for  q 
equal  to  0.1145.  Therefore,  the  probability  that  a  person 
did  read  a  newspaper  on  an  average  day,  given  that  he 
claimed  readership,  is  0.8855.  This  is  the  figure  which 
gives  an  average  audience,  exposed  to  at  least  one  news¬ 
paper,  of  90  per  cent  of  the  population.  Therefore,  in 
constructing  the  audience  tables  for  the  simulation,  we 
have  used  the  audience  tables  given  by  the  NORC  survey, 
however,  multiplying  every  figure  given  by  the  NORC  sur¬ 
vey  for  readership  by  the  factor  0.8855. 

Just  as  in  the  case  of  the  population,  the  tables 
actually  entered  into  the  computer  from  the  NORC  survey 
were  the  ten  three-dimensional  tables  which  can  be  con¬ 
structed  from  the  five  dimensions  of  the  population.  The 
rationale  for  this  was  exactly  that  of  the  population; 
namely,  that  the  sampling  errors  from  such  a  small  sample 
would  not  justify  entering  in  the  full  five-dimensional 
table.  Therefore,  the  iteration  technique  was  used  on  the 
audience  as  well  as  on  the  population.  After  the  tables 
were  entered  into  the  computer,  the  correction  factor  spoken 
about  in  the  paragraph  above  was  entered  into  the  computer 
to  multiply  each  of  these  tables  before  the  iteration  was 
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performed.  The  final  average  audiences  for  each  of  the 
three  daily  newspapers  resulting  from  the  correction  are 
as  follows: 


Average  Audiences  as  Percentages 
of  the  Total  Population 

Daily  Enquirer  Times-Star  Post 
39.2%  50.2%  47.6% 

The  NORC  survey  asked  only  about  the  readership 
of  the  daily  newspapers;  there  was  no  readership  data  for 
the  Sunday  Enquirer.  Likewise,  the  Times-Star  surveys  did 
not  ask  about  readership  of  the  Sunday  Enquirer.  There¬ 
to.,  e,  we  have  had  to  make  educated  guesses  about  the  dis¬ 
tribution  and  the  size  of  the  audience  of  the  Sunday  En¬ 
quirer.  As  a  base  line  for  estimating  the  average  audience, 
we  take  the  average  circulation  in  the  city  zone  through 
the  year  ending  March  31,  1948.  This  circulation  was 
180,150  or  about  one  and  one-half  times  that  of  any  of  the 
daily  newspapers.  The  1961  national  survey  found  that  the 
number  of  readers  per  copy  of  the  Sunday  newspapers  was 
slightly  lower  than  that  of  the  weekday  papers.  For  the 
Sunday  newspapers,  the  study  found  1.92  adult  readers  per 
copy,  compared  with  2.04  for  the  weekday  newspapers.  There¬ 
fore,  we  have  three  possible  estimates  of  the  readership  of 
the  Enquirer;  the  lowest  estimate  is  derived  from  the  value 
of  1.92  adult  readers  per  copy,  the  highest  estimate  from 
the  2.59  adult  readers  per  copy,  estimated  from  the  NORC 


data  for  this  daily  newspapers.  We  have  chosen  a  middle 
range  estimate,  that  estimate  which  is  produced  by  using 
the  number  of  adults  per  household  as  the  number  of  adult 
readers  per  copy.  This  figure,  2.23  adults  per  copy,  pro¬ 
duced  an  average  readership  for  the  Sunday  Enquirer  of 
about  65  percent  of  the  population. 

Having  estimated  the  average  audience,  however,  we 
still  must  estimate  the  distribution  of  the  audience 
throughout  the  population.  What  we  have  done  is  to  assume 
that  the  audience  is  distributed  as  is  the  audience  of  the 
daily  Enquirer ,  making,  however,  some  interesting  changes 
in  the  iteration  technique  once  we  have  made  this  assump¬ 
tion.  The  ten  three-dimensional  tables  that  describe  the 
audience  of  the  daily  Enquirer  were  entered  as  tables  for 
the  audience  of  the  Sunday  Enquirer .  However,  the  figures 
in  these  tables  were  each  multiplied  by  the  ratio  of  the 
average  audience  of  the  Sunday  Enquirer  to  the  average 
audience  of  the  daily  Enquirer  in  order  to  increase  the 
figures  to  produce  the  65  per  cent  average  audience  for 
the  Sunday  Enquirer.  Of  course,  if  some  cells  of  the 
population  are  nearJy  at  the  saturation  level  for  reader- 
ship  of  the  daily  Enquirer,  this  increase  may  very  well 
make  the  audience  of  these  cells  larger  than  the  population 
of  the  cell.  We  must  in  some  way  account  for  the  distri¬ 
bution  of  the  population  as  we  iterate  upon  the  audience 
figures  for  the  daily  Enquirer;  therefore  we  have 
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initialized  the  values  of  the  iteration  technique  with  the 
population  values  as  derived  in  the  previous  iteration. 

Thus  the  daily  Enquirer  inflated  audience  values  are  biased 
by  the  population  cell  values  to  produce  the  cell  values 
for  the  Sunday  Enquirer  audience.  This  method  is  not  fool¬ 
proof,  and  it  still  is  possible  for  some  cells  to  have 
audiences  greater  them  the  population  for  these  cells.  Of 
the  144  cells  in  the  population,  eighteen  cells  were  some¬ 
what  larger  in  audience  than  in  population.  These  cells 
were  then  shown  to  the  researcher  by  the  simulation  and 
the  researcher  was  asked  to  pick  appropriate  probabilities 
of  exposure,  i.e.,  average  audiences  for  the  inconsistent 
cells.  After  correcting  the  18  improper  cells  by  setting 
their  cell  means  at  a  value  of  .95,  the  simulated  audience 
for  the  Sunday  Enquirer  was  64.3  per  cent  of  the  popula¬ 
tion,  remarkably  close  to  the  65  per  cent  estimated  from 
the  circulation. 

Thus,  we  have  produced  the  cell-by-cell  mean  audi¬ 
ences  for  the  four  Cincinnati  newspapers  in  the  Cincinnati 
mass  media  system.  The  resulting  average  audiences  as 
proportions  of  the  population  for  the  four  Cincinnati  news¬ 
papers  are  shown  below. 

Final  Average  Audiences  for  the  Four 
Cincinnati  Newspapers  As  Percentages 
of  the  Total  Population 

Daily  Enquirer  Times-Star  Post  Sunday  Enquirer 


39.2% 


50.2% 


47.6% 


64.3% 


CHAPTER  IV 


CONSTRUCTING  THE  MEDIA  SYSTEM  II: 

THE  RADIO  AUDIENCES 

The  second  important  source  of  United  Nations  and 
international  news  during  the  six-month  period  was  radio. 

In  the  NORC  panel  at  the  end  of  the  period,  53  percent  re¬ 
ported  hearing  radio  news  programs  about  the  United  Nations 
and  26  percent  had  heard  short  radio  mentions  of  the  United 
Nations  between  programs.1  Unfortunately,  no  relevant  rec¬ 
ord  of  radio  news  messages  during  the  period  has  been  found 
(probably  nothing  exists  comparable  to  the  newspapers  them¬ 
selves  which,  of  course,  are  available  in  library  micro¬ 
films).  Therefore,  we  have  had  to  rely  on  educated  guess¬ 
work  plus  some  press  comment  and  the  memories  of  those  who 
participated  in  the  campaign  twenty  years  ago. 

At  the  time  of  the  study,  five  AM  stations,  two  FM 
stations,  and  one  television  station  were  broadcasting  in 
Cincinnati.  For  our  purposes,  the  effect  of  the  two  FM 
stations  and  the  television  station  were  negligible.  Ac¬ 
cording  to  the  City  Hooperatings ,  the  largest  combined 


1The  complete  breakdown  by  information  source  is 
found  in  NORC,  Cincinnati  Looks  Again,  p.  27. 
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rating  of  all  radio  stations  other  than  the  five  AM  stations 

in  Cincinnati  was  1.2,  i.e.  1.2  percent  of  Cincinnati  radio 

homes  at  most  were  ever  listening  to  radio  stations  other 

2 

than  the  five  AM  stations.  The  mean  rating  was  approxi¬ 
mately  0.5. 

Likewise,  the  television  audience  was  very  small 
at  this  time,  not  because  of  lack  of  interest,  but  simply 
because  there  were  so  few  television  sets  in  service.  Ac¬ 
cording  to  Variety ,  in  the  fall  of  1948  (nine  months  to  a 
year  after  the  period  of  the  study)  there  were  only  about 
4000  television  homes  in  Cincinnati.-3 

Therefore,  we  shall  restrict  our  attention  to  the 
audiences  of  the  five  AM  stations.  We  shall  assume  that  by 
far  the  largest  volume  of  United  Nations  and  international 
news  broadcast  during  the  period  wa3  through  the  regularly 
scheduled  news  broadcasts.  The  times  of  these  broadcasts 
have  been  determined  from  the  daily  radio  schedule  carried 
in  the  newspaper.  (The  content  relevant  to  the  themes  to 
be  studied  will  be  inferred  from  the  front  page  news  of  the 
newspapers  for  the  day.  This  is  described  in  more  detail 
in  Chapter  VII . ) 

2 

This  rating  is  an  average  through  the  months  October, 
1947-February,  1948.  This  peak  rating  occurred  Sunday  after¬ 
noons  at  12:30  and  4:30  P.ti.  See  C.  E.  Hooper,  Inc.  City 
Hooperatings :  Cincinnati,  Ohio,  Fall-Winter  Report:  October, 

194 /-through  February,  1948  (New  York:  C.  E.  Hooper,  Inc.,  1548) . 

^"U.S.  Video  Sets  Now  484,350,"  Variety,  August  25, 

1948,  p.  71  found  one  television  station  and  6,000  televi¬ 
sion  sets  in  circulation  in  the  40-mile  service  area  of 
Cincinnati . 


These  broadcasts  were  usually  fifteen  minutes  in 
length,  but  also  include  the  five  minute  newscasts  inserted 
at  the  beginning  of  the  hour  in  a  continuing  disk  jockey 
program,  and  in  addition,  several  "name"  newscasters  or 
commentators  who  usually  broadcasted  tor  fifteen  minutes 
in  the  evening  including  H.  V.  Kaltenborn,  Lowell  Thomas, 
Edward  R.  Murrow,  Gabriel  Heatter,  Elmer  Davis,  etc.  The 
distribution  of  these  newscasts  by  station,  day  of  the 
week,  and  hour  is  shown  in  the  chart  in  Fig.  IV-1.  The 
chart  is  constructed  from  newspaper  radio  schedules  for 

4 

the  week  of  the  second  through  the  eighth  of  January,  1948. 
Obviously,  such  a  variety  of  news  programs  will  vary  con¬ 
siderably  in  content;  therefore,  the  selection  of  themes 
broadcast  {inferred  from  the  newspaper  content  analysis) 
must  represent  some  sort  of  "average"  content. 


4  Some  ambiguities  exist  in  this  chart,  of  course, 
since  there  are  various  kinds  of  news  broadcasts.  Several 
of  the  men  mentioned  above,  e.g.  H.  V.  Kaltenborn  and  Elmer 
Davis,  commented  upon,  rather  than  reported  the  news.  Their 
commentary  would  not  be  expected  to  follow  \he  daily  (front 
page)  news  as  closely  as  the  reporting  in  an  ordinary  news¬ 
cast.  There  was  also  a  problem  in  identifying  several  of  the 
names;  Joseph  Garretson,  I  believe,  was  a  columnist  for  the 
Cincinnati  Enquirer  and  could  be  expected  to  report  occa¬ 
sionally  on  efforts  of  local  United  Nations  groups;  however, 
Waite  Hoyt,  who  was  for  many  years  the  broadcaster  for  the 
Cincinnati  Reds,  would  not  likely  report  any  relevant  news. 
Similarly,  Dallas  DeWeese  reporting  on  market  news  at  7:30 
A.M.  over  WLW  would  not  be  likely  to  report  any  international 
5r  United  Nations  news.  Finally,  the  newspaper  schedule  is 
constructed  with  15  minute  time  intervals;  therefore,  every 
program  noted  appears  as  at  least  of  a  quarter-hour  length. 
For  convenience,  they  are  all  so  represented  on  the  chart. 
However,  in  the  Hooper  rating  (Hooper,  City  Hooperatings) 
there  is  an  asterisk  throughout  indicating  that  ail  but  four 
of  the  WCKY  newscasts  indicated  on  the  charts,  and  several 
of  the  WCPO  newscasts  were  actually  five-minute  inserts  at 
the  beginning  of  the  hour. 


There  exists  some  independent  validation  of  the  data 
in  the  chart  as  well  as  information  about  the  time  devoted 
to  international  news.  According  to  a  Times-Star  article 
of  November  21,  1947,  the  Xavier  University  International 
Relations  Club,  in  a  week-long  survey  of  the  Cincinnati 
stations,  found  that 

.  .  .  local  stations  were  on  the  air  for  124  hours 
or  7440  minutes  (average  per  station)  during  the  week 
in  question.  There  were  81  newscasts  per  day  for 
1/005  minutes.  International  relations  news  occupied 
224  minutes  daily,  and  United  Nations  news  rated  56 
minutes,  on  the  seven  stations  studied. ^ 

The  chart  in  Fig.  IV-1  (of  5  stations,  not  including  the 

2  FM  stations)  records  442  news  broadcasts  for  the  week  or 

63  per  day.  In  addition,  the  two  FM  stations  combined 

averaged  16  news  broadcasts  daily  for  a  total  of  79  per 

day  for  the  week  of  January  2-8.  This  compares  with  the 

81  newscasts  per  day  found  by  the  Xavier  study.  If  each 

news  broadcast  lasted  15  minutes  except  for  the  eleven  on 

WCKY  and  WCPO  daily,  which  lasted  5  minutes  each,**  then 

the  total  number  of  news  broadcasting  minutes  per  day  was 

approximately  1075  minutes  (cf.  1005  minutes  in  the  Xavier 

survey) .  We  note  also  that  Xavier  survey  implies  ar  average 

of  2.8  minutes  per  broadcast  devoted  to  international 

5 

"Students  Study  Local  United  Nations  Programs , " 
Times-Star,  November  21,  1947,  p.  45. 

^See  footnote  1  above. 


relations  news  and  just  seven-tenths  of  a  minute  average 
to  United  Nations  news. 

The  Definition  of  Vehicles  for  Radio 

How  are  we  to  aggregate  these  many  broadcasts  into 
vehicles  of  the  several  themes?  At  first  the  obvious 
choice,  seemingly  analogous  to  the  definition  of  each  news¬ 
paper  as  a  vehicle  (despite  the  several  editions) ,  would  be 
to  define  the  station  itself  as  the  vehicle.  However,  we 
should  examine  more  closely  the  meaning  of  the  term  vehi¬ 
cle  in  the  context  of  the  simulation. 

What  exactly  is  a  vehicle?  A  vehicle  is  a  frame¬ 
work  for  a  set  of  messages.  The  framework  (although  not 
necessarily  the  particular  set  of  messages)  appears  regu¬ 
larly  and  what  is  more  important,  it  tends  to  have  similar 
(not  completely  random)  audiences,  both  in  size  and  com¬ 
position,  at  each  appearance.  Thus,  an  average  audience 
and  its  breakdown  along  several  dimensions  can  be  defined 
(at  least  in  principle).  In  the  simulation,  each  member 
of  the  population  has  a  fixed  probability  of  exposure  to 
the  vehicle.  The  fluctuations  in  the  audience  size  and 
composition  (and  thus  in  the  emulation  and  repeat  expo¬ 
sure  over  several  time  periods)  fellow  from  these  prob¬ 
abilities,  i.e.,  since  most  of  the  probabilities  are  not 
identically  0.0  or  1.0,  the  variance  of  the  audience  is 
not  0.0.  These  random  fluctuations  will,  however,  in  no 
way  account  for  the  fact  that  a  mid-morning  news  message 
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of  a  given  station  reaches  mostly  women,  while  an  evening 
news  message  of  the  same  station  reaches  almost  equal  num¬ 
bers  of  men  and  women.  But  haven't  we  provided  the  possi¬ 
bility  of  modifying  the  vehicle  exposure  probabilities  with 
message  exposure  probabilities  and  format  weights  at  a 
later  stage  in  the  simulation?  This  is  true,  but  the  effect 
is  not  the  same  as  changing  the  audience  composition  for  the 
vehicle.  The  format  weight  applies  equally  to  each  audience 
type  for  the  particular  message.  The  message  exposure 
probabilities  are  intended  to  vary  for  different  population 

types;  however,  they  are  constant  for  every  message  of  the 
7 

theme.  Now  we  shall  see  that  during  the  day  the  size  of 
the  audience  of  a  station  varies  greatly,  reaching  peaks  at 
8:00  and  9:00  A.M. ,  at  noon  and  at  1:00  P.M.  and  especially 
during  the  evening.  Moreover,  the  composition  of  the  audi¬ 
ence  chang as  as  well;  during  the  day,  women  are  a  much 
greater  proportion  of  the  audience  than  during  the  hours 
before  9:00  A.M.  or  after  6:00  P.M.  It  is  therefore  impos¬ 
sible  for  the  simulation  to  match  the  real  audience,  if  we 
treat  the  station  as  the  vehicle  and  the  several  newscasts 
during  the  day  as  separate  messages  carried  in  the  vehicle. 

There  is  one  other  difficulty  in  the  station-as- 
vehicle  formulation,  namely,  the  duplication.  In  the 

i 

i 

i 

7  ! 

See  the  discussion  of  message  exposure  probabili-  ] 

ties  in  Chapter  VII  for  a  more  technical  explanation  of  S 

the  format  weight  and  message  probability  effects,  1 


simulation,  by  far  the  largest  part  of  duplication  between 
two  vehicle  audiences  is  accounted  for  by  the  cell  means, 
e.g.  if  each  of  two  vehicles  appeals  more  to  the  highly  edu 
cated  than  to  the  less  educated,  the  duplication  will  be 
larger  them  the  random  intersection  of  the  two  average 
audiences,  since  both  vehicles  will  tend  to  have  high  prob¬ 
abilities  of  exposure  in  the  same  subset  of  cells  or 
audience  types.  In  fact,  the  simulation  can  only  make 
small  modifications  from  this  "audience-type-produced" 
duplication  when  this  duplication  departs  from  empirical 

O 

data.  However,  in  the  event  of  two  or  several  newscasts 
at  the  same  time  over  different  stations,  because  of  the 
nature  of  the  medium,  i.e.  the  message  exists  only  for  a 
brief  time  and  then  is  gone,  the  audience  duplication  will 
be  virtually  zero,  even  though  the  audience  characteristics 
of  each  station  be  similar.  Were  we  to  use  stations  as 
vehicles  then,  we  would  find  ourselves  in  the  unhappy  posi¬ 
tion  of  vainly  attempting  to  simulate  nearly  random  dupli¬ 
cation  between  one  pair  of  messages  over  Station  A  and 
Station  B  when  they  occur  at  different  times,  and  then 
zero  duplication  between  another  pair  of  messages  over  the 
same  two  stations  when  they  occur  at  nearly  the  same  time. 

The  solution  to  these  problems  is  to  define  the 
time  period  as  the  vehicle.  It  appears  a  reasonable 

Q 

See  the  discussion  of  duplication  in  Chapter  VII, 
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assumption  that  for  a  given  time  slot  and  day  of  the  week, 
the  audience  total  over  all  stations  is  relatively  constant 
both  in  size  and  composition.  The  steps  in  calculating  the 
audience  distributions  for  these  vehicles  are  more  or  less 
dictated  by  the  (poor)  form  of  the  presentation  of  data 
current  at  the  time.  The  most  widely  distributed  data  are 
program  ratings ,  which  are  actually  average  percentages  of 
the  non-toll-call  telephone  homes  of  a  city  which  have  a 
radio  receiving  the  given  program.  We  shall  make  the 
assumption  that  these  ratings  can  be  projected  to  all  house¬ 
holds  in  the  population  (and,  of  course,  that  the  entire 
population  lives  in  households) .  We  further  assume  that 
in  every  household  only  one  radio  is  playing  at  any  one 
time,  and  then  use  listener  densities,  i.e.  numbers  of 
adult  males  and  females  listening  per  radio  set  playing 
(again,  projected  from  the  sample  of  non-toll  .  .  .  etc.), 
to  calculate  actual  radio  audiences.  Finally,  we  solve 
two  simultaneous  equations  involving  these  audiences  and 
additional  data  to  produce  audiences  which  take  account 
of  the  substantial  listening  outside  of  the  home,  e.g. 
while  visiting  or  while  commuting  in  cars. 

"Eoopexatings" 

The  first  step  is  to  calculate  the  rating  for  news 

g 

for  each  hour  of  each  day.  The  Hooper  ratings  are  available 

9 

For  further  discussion  of  the  method  of  measurement 
and  significance  of  these  ratings,  see  the  Appendix  on  Radio 
Audience  Measurement. 
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as  weekly  averages  for  each  15  minute  time  period  from 
8:00  A.M.  to  7:45  P.M.  These  are  averages  for  the  15  minutes 
over  the  five  weekdays  (Monday  through  Friday)  and  also  over 
the  five  months  October,  1947,  through  February,  1958.  Al¬ 
though  the  radio  audience  increases  from  fall  to  winter,  we 
shall  use  these  average  ratings  over  the  period.  Hooper  pro¬ 
vides  daily  half-hour  ratings  for  the  evening  hours  (from 
8:00  P.M.  to  10:30  P.M.)  but  these  have  also  been  averaged 
(for  news  broadcasts)  to  produce  an  average  weekday  since 
the  ratings  for  the  news  broadcasts  do  not  fluctuate  greatly 
during  the  week. 

As  the  chart  in  Fig.  IV-1  clearly  shows,  the  news 
broadcasts  for  a  given  station  occurred  regularly  through¬ 
out  the  weekdays,  i.e.  if  WKRC  broadcasted  news  at  7:00  A.M. 
Monday,  there  was  usually  a  7:00  A.M.  news  broadcast  on  each 
of  the  other  four  weekdays.  Thus  the  rating  for  one  hour 
of  a  weekday  suffices  for  the  other  weekdays,  since  the 
scheduling  was  quite  constant.  Indeed,  as  was  pointed  out 
above,  the  Hooper  ratings  are  averages  for  the  weekdays 
during  the  daytime  and  early  evening  hours.  Occasionally 
there  occurred  slight  changes  in  scheduling  from  day  to  day: 
sometimes  these  were  ignored  in  calculating  the  news  ratings 
(e.g.  the  7:15  P.M.  news  over  WLW  which  occurred  only  on 
Monday) ,  or  when  possible  the  changes  were  averaged  into 
the  schedule  (e.g.  the  8:30  A.M.  news  over  WKRC  on  Tuesday, 
Wednesday,  and  Thursday  was  considered  equivalent  to  the 
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9:00  A.M.  news  on  Monday  and  Friday  and  a  constant  9:00 
A.M.  WKRC  news  was  added  in  che  ratings) . 

The  news  ratings  for  each  hour  for  the  weekdays 
from  8:00  A.M.  to  12:00  P.M.  (midnight)  are  the  sum  (across 
stations)  of  ratings  for  each  news  broadcast  at  (or  quite 
near)  the  given  hour.  For  example,  the  rating  for  the 
weekday  1:00  P.M.  news  broadcast  is  the  sum  of  ratings 
for  the  following  programs:  15  minutes  of  WKRC  and  WCPO 
news  at  1:00  P.M.,  15  minutes  of  WLW  news  at  12:30  P.M., 
and  5  minutes  of  WCKY  news  at  1:00  P.M.  (WSAI  broadcast 
no  news  at  or  about  this  hour) .  The  ration  ile  for  adding 
these  ratings  is  that  we  have  no  way  of  distinguishing  the 
content  of  one  of  these  news  programs  from  another;  there¬ 
fore,  we  assume  that  everyone  who  listened  at  this  hour 
was  exposed  to  the  same  news  message  and  that  the  dupli¬ 
cation  among  the  audiences  of  the  programs  is  nearly  zero. 
It  seems  unlikely  that  someone  listening  to  the  1:00  P.M. 
WKRC  news  would  also  have  listened  to  any  of  the  other  news 
broadcasts  at  that  time. 

By  this  process  we  have  calculated  the  ratings  for 
news  broadcasts  from  8:00  A.M.  through  10:00  P.M.  Monday 
through  Friday,  from  6:00  P.M.  through  10:00  P.M.  Saturday 
and  from  12:00  A.M.  (noon)  through  10:00  P.M.  Sunday.  How¬ 
ever,  fcr  the  Saturday  daytime  ratings  only  summary  data 
are  provided  by  .ooper.  We  have  estimated  these  ratings 
in  the  following  manner:  the  Hooper  data  show  that  the 


average  percentage  of  "sets-in-use"  during  the  Saturday 
daytime  was  20.2  per  cent,  nearly  equal  to.  the  average  of 
18.3  per  cent  "sets-in-use"  for  weekday  afternoons. ^ 
Therefore,  we  have  simply  used  the  hourly  "s< ts-in-use" 
averages  for  a  weekday  (see  Fig.  IV-2)  with  the  "share  of 
audience"  averages  for  Saturday  and,  of  course,  the  Satur¬ 
day  news  broadcasting  schedule,  to  compute  news  broadcast 
ratings  for  Saturday.  For  the  Sunday  morning  hours  8:00 
A.M.  through  11:00  A.M.  we  have  made  the  same  calculation 
and  used  two-thirds  of  the  resulting  rating  to  account 
for  families  sleeping  later  in  the  morning  and/or  attending 
church,  etc.11 

The  calculations  above  leave  ratings  yet  to  be 
estimated  for  the  morning  hours  6:00  and  7:00  A.M.  and  the 
evening  hours  11:00  P.M.  and  12:00  P.M.  and  1:00  A.M.  The 
graphs  below  in  Figs.  IV-2-7  give  some  indication  of  amount 
of  listening  during  these  hours.  The  first  graph  displays 

10These  data  are  taken  from  Hooper,  City  Hooperat- 
ings,  p.  7. 

11Sjjice  there  was  a  total  of  only  10  news  broad¬ 
casts  during  the  hours  8:00  A.M.  through  11:00  A.M. 

Sunday  morning  (compared  with  15  news  broadcasts  during 
the  same  hours  on  Saturday) ,  these  ratings  are  not  so 
critical. 
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Figure  IV-2.  The  Relative  Ntsaber  of  Seta  In  Use  (Nielsen);  or. 
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WOMEN 


Sunday,  Nov.  3,  1946  -  Champaign  County 
Sunday,  July  27,  1947  -  McLean  County 


Figure  IV- 3»  Pattern  of  Sunday  Listening  by  Urban  Men  and  Women. 


Time  of  Day 

•Sunday,  Nov.  3,  1946  -  Chaapaign  County 
•Sunday,  July  27,  1947  -  McLean  County 


Figure  IV-4.  Pattern  of  Sunday  Listening  by  Fara  Men  and  Wonen 


Percentage  Listening 


Percentage  Listening  Percentage  Listening 


the  Hooper  "sets-in-use"  rating  and  comparable  Nielsen 

12 

Raaio  Inaex  ratings  for  two  periods. 

The  graphs  in  Figs.  IV-3-7  above  display  not  the 
ratings ,  bat  the  actual  percentages  of  men  and  women  lis¬ 
tening,  found  by  studies  in  two  central  Illinois  counties 

in  1946  and  1947,  These  results  are  for  urban  residents, 

13 

for  a  weekday  and  a  Sunday,  in  summer  and  fall.  A  rough 
comparison  of  Sunday  and  weekday  listening  seems  to  sup¬ 
port  these  assumptions:  (1)  that  early  Sunday  listening 
is  much  like  weekday  listening  except  that  it  happens  about 


12 

The  Hooper  ratings  (Hooper,  City  Hooperatmgs) 
are  weekday  averages  for  October,  1947-February ,  1948, 
inclusive.  The  Nielsen  ratings,  cited  in  Sandage,  Radio 
Advertising,  p.  133  are  weekday  averages  for  the  period 
December  27,  1942-January  23,  1943  and  the  period  July  11 
August  7,  1943.  It  should  be  noted  that  the  Hooper  and 
Nielsen  measurements  differ  somewhat,  Hooper  measuring 
average  listening  and  Nielsen  measuring  total  listening. 
Except  possibly  at  the  extreme  ends  of  the  curves.  Hooper 
probably  measures  10-30  per  cent  less  than  the  equivalent 
Nielsen  rating.  See  the  discussion  in  the  "Appendix  on 
Radio  Measurement,"  below. 


^Results  of  the  two  surveys  are  found  in  Charles 
H.  Sandage,  "Qualitative  Analysis  of  Radio  Listening  In  Two 
Central  Illinois  Counties,”  Bureau  of  Economic  and  Business 
Research  Bulletin  Series,  No.  68,  and  University  of  Illinois 
Bulletin,  Vol.  46,  No.  50  (March,  1949TT  The  diary  method 
was  used  to  record  listening.  Every  adult  (18  years  of  age 
or  older'  member  of  a  participating  family  recorded  his 
listening  by  15  minute  periods  from  6:00  A.M.  to  midnight 
for  seven  consecutive  days.  The  families  were  a  cross  sec¬ 
tion  of  families  in  two  central  Illinois  counties  and  pro¬ 
vided  a  sample  of  790  adults  in  McLean  County  and  528  adults 
in  Champaign  County.  Each  of  these  counties  contains  an 


urban  center  of  about  40,000  permanent  residents,  several 
villages,  and  an  extensive  farm  population.  The  Champaign 
County  study  covered  the  week  beginning  November  3,  1946, 
McLean  County  the  week  beginning  July  27,  1947. 
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one  hour  later,  and  (2)  that  late  Sunday  listening  is  much 
like  late  listening  on  other  days.  Finally,  to  get  the 
early  and  late  weekday  ratings ,  we  have  chosen  to  sketch 
in  a  continuation  of  the  Hooper  ratings  in  Figure  IV-2  fol¬ 
lowing  roughly  the  Nielsen  curves.  This  gives  the  following 
sets-in-use  ratings:  6:G0  A.M.-2.0,  7:00  A.M.-8.5,  11:00 
P.M.-18.0,  12:00  P.M.-5.Q,  1:00  A.M.-1.0.  Then  from  the 
ratings,  the  appropriate  "shares  of  the  audience,"  the 
broadcast  schedule,  and  the  assumption  that  Saturday  lis¬ 
tening  is  much  like  weekday  listening,  we  have  completed 
the  news  broadcast  ratings  (Table  IV-1  below) . 

The  next  step  is  to  turn  these  ratings  into  audi¬ 
ences.  For  this  purpose,  we  need  data  on  the  number  of 
listeners  per  listening  household,  i.e,  data  on  listening 
densities,  by  hours  of  the  day  and  days  of  the  week. 

Listening  Densities 

In  addition  to  the  daily  and  hourly  variation  in 
the  number  of  homes  listening  to  radio  (in  the  case  of  the 
Hooper  ratings,  we  have  equated  the  number  of  sets-in-use 
and  the  number  of  households  listening) ,  there  is  also  a 
daily  and  hourly  variation  in  the  number  and  kind  of  lis¬ 
teners  per  households  This  situation  is  illustrated  by 

14 

the  graph  in  Fig.  IV-8.  *  The  audience  and  number  of 

14 

All  the  data  in  this  section  ure  taken  from  Chapter 
5  of  Eugene  F.  Seehafer  and  Jack  W.  Laeremar,  Successful  Radio 
and  Television  Advertising  (New  York:  McGraw-Hill  Book  Co., 
Inc.,  1951).  The  graph  is  found  on  p.  105.  The  authors  cite 
CBS  as  the  source  of  the  data;  however,  no  publication  or 
date  is  provided. 
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Table  IV- 1.  Average  Ratings  for  Hourly  News  Broadcasting 


Days  of  the  Week 


Hour 

Monday-Friday 

Saturday 

Sunday 

6:00  A.M. 

1.6 

1.6 

0.0 

7:00 

8.5 

8.5 

1.2 

8:00 

14.3 

14.0 

6.6 

9:00 

7.2 

11.5 

4.9 

:  .00 

4.5 

11.4 

4.1 

il:00 

14.2 

15.3 

10.0 

12:00 

14.6 

18.4 

22.1 

1:00  P.M. 

16.7 

13.6 

16.1 

2:00 

1.6 

3.6 

2.3 

3:00 

5.7 

12.3 

7.8 

4:00 

7.4 

16.2 

2.9 

5:00 

11.0 

17.6 

3.7 

6:00 

32.0 

30.0 

14.8 

7:00 

33.3 

30.8 

2.0 

8:00 

22.4 

13.0 

3.0 

9:00 

7.1 

6.0 

1.9 

10:00 

.4 

2.2 

0.0 

11:00 

15.5 

15.5 

15.5 

12:00 

4.3 

3.3 

4.3 

1:00  A.M. 

1.0 

1.0 

1.0 

Notes:  These  ratings  were  calculated  from  Hooper  program 

ratings  and  vary  both  with  those  ratings  and  with  the 
schedule  of  news  broadcasts.  The  calculations  are 
described  above. 


Number  Listening 


6  7  8  9  10  1112  1  2  345  6  78  9  10  1112 
P.M.  HjD 

Time  of  Day 


m  Children 


Adolescents 

Women 


Figure  I?-8.  Weekday  Radio  Audience  by  Hours.  The  heavy  darl* 
line  indicates  the  number  of  families  in  the  audience  duriti 
all  hours  of  the  day.  (CBS.) 
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listening  families  of  radio  station  WBBM,  Chicago,  are  de¬ 
picted  for  each  hour  from  6:00  A.M.  to  12:00  P.M.  for  an 
average  weekday*  The  audience  increases  slowly  through  the 
morning  hours,  declines  somewhat  in  the  afternoon,  and  then 
rises  sharply  in  the  evening  until  about  8:30  P.M.;  there¬ 
after  it  declines  sharply  again. 

The  composition  of  the  audience  also  changes. 

During  both  the  very  early  morning  hours  and  the  evening 
hours,  nearly  as  many  men  as  women  are  in  the  audience; 
during  the  work  hours,  9:00  A.M.  to  5:00  P.M.,  the  audience 
is  composed  largely  of  women.  However,  at  no  time  of  the 
day  are  there  as  many  men  as  women  in  the  audience.15 

Equating  the  number  of  families  listening  with  the 
number  cf  sets-in-use,  we  can  then  make  a  rough  calcula¬ 
tion  from  the  graph  of  the  listening  densities  for  men  and 
women  for  each  hour  of  the  day.  These  are  shown  in  Table 
IV-2  below  and  also  plotted  in  Fig.  IV-12. 

In  addition  to  the  data  for  the  single  Chicago  radio 
station,  Seehafer  also  cites  (p.  106-107)  national  densities 
(Tables  IV-3  and  IV-4  below)  from  one  of  the  Hooper  coast- 
to-coast  studies.  The  date  given  is  1949;  however,  it  is 
not  clear  whether  this  refers  to  the  data  of  the  study,  or 


This  general  pattern  was  also  found  in  the  two 
diary  studies  reported  by  Sandage  in  Radio  Listening  in  Two 
Central  Illinois  Counties.  See  the  graphs  in  Figs.  IV-2 


Densities 

(Listeners  per  Set-in-Use) 


Time 


Men 


Women 


Table  IV-3.  Composition  of  the  Radio  Audience  -  Daytime.  (Number  of  listeners  per 
listening  set) 
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Source:  C.  E,  Hooper,  Inc.,  1949 
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of  its  publication,  or  both.  These  data  probably  represent 

a  sample  of  the  non-toll  call  telephone  homes  of  the  thirty 

16 

or  so  largest  U.S.  cities  sometime  during  1949.  Since 
these  are  the  only  listening  density  data  available,  we 
shall  use  them  in  the  audience  estimation;  however,  we 
might  ask  several  questions  about  the  data. 

It  appears  likely  that  the  data  were  collected  at 
least  a  year  after  the  study  period.  This  raises  the  pos¬ 
sibility  that  television  listening  might  have  begun  to 
change  s.he  patterns  of  radio  listening  by  mid-1949.  We 
can  offer  some  evidence  that  that  was  not  the  case:  first, 
there  were  approximately  twenty  times  as  many  radio  as 
television  homes  in  1949  (39,280,000  radio  homes  to 
1,960,000  television  homes);  second,  as  the  table  below 

indicates,  the  average  number  of  hours  listened  per  day  per 

17 

(Nielson)  home  changed  very  litcle  during  the  period. 


^On  pp.  89-90  of  his  Radio  Audience  (1944) , 

Hooper  describes  his  sample  for  such  data  as  "the  tele¬ 
phone  home  population  of  the  thirty-two  large  cities  of 
equal  network  opportunity."  He  does  not  provide  his  age 
break  between  adults  and  children. 

17 

The  radio  datum  is  a  Broadcast  Measurement 
Bureau  estimate  cited  in  Schaefer,  p.  102.  The  television 
estimate,  also  cited  in  Schaefer  (p.  104),  is  from  Broad¬ 
casting,  July  10,  1950.  The  table  is  reproduced  from 
Schaefer,  p.  108.  The  source  is  the  A.  C.  Nielsen  Company. 
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Average  Number  of  Hours  Listened  Per  Day  Per  NRI  Home 
Annual  Average,  1946-1949 


Year 


Hours 


1946  4.0 

1947  4.3 

1948  4.4 

1949  4.2 

If  there  were  no  changes  due  to  the  introduction 
of  television,  were  there  possibly  seasonal  changes  in  lis¬ 
tening  densities?  It  is  generally  assumed,  probably  on 
the  basis  of  ratings,  that  listening  increases  in  the  fall 
and  winter.  The  ratings,  equivalent  to  the  percentage  of 
sets-in-use,  do  indeed  show  seasonal  fluctuations  as  the 
graphs  in  Figs.  IV-9,  IV-10,  and  IV-11  demonstrate.  How¬ 
ever,  strictly  speaking,  the  increase  in  the  ratings  from 
summer  to  winter  does  not  necessarily  imply  an  increase  in 
the  audience;  only  if  the  listening  density  is  assumed  not 
to  decline  very  much  can  we  be  confident  that  the  audience 
has  increased.  The  tacit  assumption  seems  to  be  that  the 
listening  density  is  rather  constant,  and  it  is  upon  this 
assumption  that  we  proceed.  Thus  it  is  consistent  with  all 
the  data  and  rather  plausible  that  the  listening  density 
was  rather  c  .  ’tant  at  least  during  the  period  from  1947  to 
1949. 


The  data  from  these  tables  have  been  plotted  together 
18 

in  Fig.  IV-12.  In  general,  the  shapes  of  the  WBBM  and 


18 

The  Hooper  data  for  weekday  evenings  has  been  aver¬ 
aged  to  produce  a  single  weekday  (Monday-Friday)  evening  plot. 
Also,  since  the  form  is  hourly  averages,  the  values  have  been 
plotted  on  the  half  hour,  e.g..  the  value  for  8:00-9:00  P.M. 
has  been  plotted  at  8:30  P.M. 


Dac  Jan  Feb  Mar  Apr  May  Jun  Jul  Aug  Sep  Oct  Nov 
15,  15, 

1945  1948 

=====1945-46  own  1946-47  ——■1947-48 


Figure  IV-9.  Sets -In-Use  Index  for  Day  tine  (Hon.-Fri. , 
8:00  A.M.-  6:00  (Hooper  Baalc  Audience  Trends) 
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Figure  IV-10. Sets -In-Use  Index  for  Evenings  (Sun. -Sat., 
6:00-10:30  P.M.).  (Hooper  Basic  Audience  Trends) 
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Figure  IV— 1  "LSets-In-Use  Index  for  Evenings  (Sun. -Sat.,  6:00-10:30  P.M.) 
(Hooper  Basic  Audience  Trends) 
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Hooper  curves  for  men  and  women  are  quite  similar?  the  two 
obvious  discrepancies,  for  men  at  12:30  A.M.  and  for  women 
at  5:30  P.M.,  may  be  due  to  peculiarities  of  programming  by 
the  single  station.  At  any  rate,  since  the  Hooper  curves 
are  more  accurate,  and  since  the  Hooper  sample  consists  of 
residents  of  cities,  we  have  in  general  relied  on  the  Hooper 
data. 

However,  the  Hooper  data  are  incomplete,  lacking 

the  listening  densities  for  the  Sunday  morning  hours  and 

for  the  early  morning  and  late  night  hours  for  each  day  of 

the  week.  We  have  completed  the  missing  curves,  generally 

paralleling  the  shapes  of  the  known  WBBM  and  Hooper  data. 

The  reasoning  is  as  follows:  both  the  WBBM  and  the  Sar.dage 

data  indicate  that  although  the  audience  is  relatively 

smaller  during  very  early  morning  and  very  late  evening 

hours,  there  is  a  nearly  equal  number  of  men  and  women  at 

those  times.  Therefore  the  listening  densities  must  be 
19 

nearly  equal.  This  means,  in  general,  that  the  listen¬ 
ing  density  for  women  must  decline  at  the  extremes  of  the 
day,  and  that  for  men  must  increase.  The  rate  of  the  re¬ 
spective  changes  was  judged  from  the  WBBM  curve.  Since 
the  densities  for  men  at  10:30  P.M.  were  approximately 
equal  to  those  at  6:30  A.M. ,  we  have  left  them  nearly  constant 

19 

This  is  obvious,  since  the  listening  density  for 
each  sex  is  just  the  number  listening  of  that  sex  divided 
by  the  total  numbers  of  sets  in  use. 
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from  10:30  P.M.  to  1:00  A.M.  Thus  have  all  the  listening 
densities  been  computed.  The  data  and  method  here  are  ob¬ 
viously  inexact;  however ,  we  believe  that  they  are  as  pre¬ 
cise  as  warranted,  given  the  inaccuracies  in  the  content 

analysis  and  message  exposure  probabilities,  which  are  dealt 
20 

with  later.  The  final  values  of  the  densities  are  pre¬ 
sented  in  Tables  IV-6 ,  IV-7,  and  IV-8  below. 

The  News  Broadcast  Audiences 

We  now  calculate  the  audiences  for  news  broadcasts. 

The  ratings  are  approximately  the  percentages  of  households 

listening.  The  densities  are  the  number  of  male  and  female 

adult  listeners  per  hundred  households.  The  product  of  the 

ratings  and  the  total  number  of  households  gives  the  number 

of  listening  households.  This  number  multiplied  by  the  den- 

21 

si ties  gives  the  number  of  adult  male  and  female  listeners. 


20 

Also,  since  the  audiences  are  calculated  as  the 
product  of  the  ratings  and  densities,  and  since  the  ratings 
are  very  small  in  the  very  early  morning  and  very  late  even¬ 
ing  hours,  the  large  percentage  errors  in  the  densities  should 
not  significantly  change  the  overall  daily  audience  structure. 

21 

Actually,  not  all  adult  males  and  females  lived 
in  households.  According  to  the  1950  census  of  the  Cincin¬ 
nati  Standard  Metropolitan  Area,  within  the  27f. ,715  house¬ 
holds  of  the  area  lived  874,510  of  the  904,402  total  popula¬ 
tion.  The  institutional  population  was  9,297,  leaving  20,595 
members  of  the  population  (904,402  minus  874,510  minus  9,297) 
"at  large"  but  not  living  in  households.  We  have  ignored 
this  discrepancy  in  making  the  calculations. 


Listening  Outside  the  Home 


The  Hooper  data,  upon  which  the  audience  sizes 
have  been  based,  measure  only  the  listening  in  the  home. 
About  1949,  as  it  was  becoming  obvious  that  television 
was  going  to  diminish  the  size  of  the  radio  audience, 
several  studies  were  conducted  to  measure  the  size  of  the 
audience  outside  the  home.  These  studies  found  that  there 
were  indeed  a  significant  number  of  outside  listeners,  of¬ 
ten  numbering  as  much  as  30  or  40  percent  of  the  audience 

22 

at  home.  There  are  no  Cincinnati  or  national  data  among 
these  studies;  therefore  we  have  had  to  adapt  them  for  the 
Cincinnati  audience. 

Table  IV- 5  below  shows  Pulse  data  on  the  hourly 
in-home  and  out-of-home  listening  in  the  New  York  City  area 
during  August  1949. 2 ^ 

Graphs  of  out-of-home  listeners  as  a  percent  of 
in-home  listeners  for  weekdays,  Saturday,  and  Sunday,  are 
shown  in  Fig.  IV-13.  Also,  this  percentage  has  been  extra¬ 
polated  for  each  of  three  curves  to  provide  values  for  the 
early  morning  and  late  evening  hours. 


22 

Several  of  these  studies  are  cited  in  H.  M. 
Beville,  Jr.,  The  True  Dimensions  of  the  Radio  and  Televi- 
sion  Audience  (New  York:  National  Broadcasting  Co.,  Inc., 
1949)  . 


23 


Cited  by  Beville,  True  Dimension,  p.  25. 
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Table  IV-5. 

Comparison  of 

In-Home  and  Out- 

-of-Home  Listening 

Outside 

Listeners 

Listeners 

Listeners 

Out-of 

in 

As  Percent 

Home 

Home 

of  Inside 

Time 

(in  thous.) 

(in  thous.) 

Listeners 

Monday-Friday 

9-10  A.M. 

134.4 

769.9 

17.5% 

10-11 

168.0 

1,165.5 

14.4 

11-12 

134.4 

1,150.2 

11.7 

12-1  P.M. 

190.4 

1,028.1 

18.5 

1-2 

246.4 

838.2 

29.4 

2-3 

436.9 

946.4 

46.2 

3-4 

616.1 

1,051.6 

58.6 

4-5 

369.7 

1,171.8 

31.5 

5-6 

190.4 

1,318.3 

14.4 

6-7 

156.8 

1,649.7 

9.5 

7-8 

67.2 

1,850.0 

3.6  ; 

8-9 

123.2 

1,810.0 

6.8 

i 

Daily  Avg. 

236.2 

1,228.5 

19.2% 

Saturday 

9-10  A.M. 

33.6 

555.9 

6.0% 

10-11 

145.6 

1,197.2 

12.1 

11-12 

89.6 

1,379.6 

6.5 

12-1  P.M. 

156.8 

1,143.6 

13.7 

1-2 

168.0 

975.2 

17.2 

2-3 

369.7 

1,215.9 

30.4 

3-4 

347.3 

1,267.6 

27.4 

4-5 

358.5 

1,391.5 

25.8 

5-6 

235.2 

1,367.7 

17.2 

6-7 

212.8 

1,521.7 

14.0 

7-8 

134.4 

1,489.8 

9.0 

8-9 

358.5 

1,320.1 

27.2 

Daily  Avg. 

217.5 

1,235.5 

17.6% 

Time 


Outside 


Time 

Listeners 

Out-of 

Home 

(in  thous.) 

Listeners 

in 

Home 

(in  thous.) 

Listeners 
As  Percent 
of  Inside 
Listeners 

Sunday 

9-10  A.M. 

33.6 

776.2 

4.3% 

10-11 

89.6 

1,109.8 

8.1 

11-12 

112.0 

1,170.5 

9.6 

12-1  P.M. 

78.4 

1,283.2 

6.1 

1-2 

201.6 

1,100.1 

18.3 

2-3 

369.7 

1,439.3 

25.7 

3-4 

481.7 

1,742. 

27.7 

4-5 

616.1 

2,065.7 

29.8 

5-6 

627.3 

1,984.3 

31.6 

6-7 

403.3 

2,197.1 

18.4 

7-8 

795.3 

2,182.7 

36.4 

8-9 

997.0 

2,397.1 

41.6 

Daily  Avg. 

400.5 

1,620.7 

24.3% 

Source:  The  Pulse,  "Radio  Listening  Out  of  Home  in  New 
York , "  August ,  1949. 


lome  Listeners  for  Weekdays,  Saturdays,  and  Sundays 


In  addition  to  the  data  above,  Beville  also  cites 
some  data  on  out-of-home  listening  bv  sex.  The  Psychology 


cal  Corporation,  in  a  study  of  the  Des  Moines,  Iowa,  and 
Springfield,  Massachusetts, areas ,  found  (not  surprisingly) 
that  males  do  a  much  larger  proportion  of  their  day's  lis¬ 
tening  outside  the  home,  than  do  females.  This  also  holds 

true  for  single  people  compared  with  married  people.  The 

24 

data  are  presented  below. 


Males  Females 

-  -  Total 

F:ngle  Married  Single  Married  Group 


Average  Number  of  Minutes  of  Listening 


Outside  Listening 

56 

30 

19 

17 

27 

Inside  Listening 

111 

161 

192 

258 

196 

Total 

167 

191 

211 

275 

223 

Per  Cent  of  Total 

Day's  Listening 

Outside  Listening 

34% 

16% 

9% 

6% 

12% 

Inside  Listening 

66 

84 

91 

94 

88 

Total 

100% 

100% 

100% 

100% 

100% 

We  have  used  these  data  to 

adjust  the 

radio 

audi- 

ences  to  include  outside 

listening . 

For  each 

hour 

of  the 

p.  25. 


24 


The  data  are  cited  by  Beville, 


True  Dimensions , 


day,  the  total  audience  has  been  increased  by  the  proportion 
given  in  the  graphs  (Fig.  IV-13)  of  the  Pulse  data.  How¬ 
ever,  since  the  Psychological  Corporation  data  show  that  the 
per  cent  of  the  total  day's  outside  listening  for  males  is 
about  three  times  that  for  females  (the  married  males:  16 
per  cent;  married  females:  6  percent;  single  males:  34 
per  cent;  single  females:  9  percent),  we  have  required 

the  proportional  increase  for  males  to  be  three  times  that 
25 

for  females.  This  then  allows  the  correction  to  account 
for  outside  listening  for  each  hour  of  the  day.  The  final 
values  of  the  audiences  are  presented  in  Tables  IV-6,  IV-7, 
and  IV- 8  below. 


25 

The  equations  are  quite  simple.  Let: 

L  =  the  number  of  male  in-home  listeners  at  any  time, 
m 

Lf  =  the  number  of  female  in-home  listeners  at  any  time. 

P  =  the  proportion  by  which  the  male  in-home  listening  is 
m  to  be  increased. 

Pf  =  the  proportion  by  which  the  female  in-home  listening 
is  to  be  increased. 

P  =  the  total  proportion  by  which  the  in-home  listening 
is  to  be  multiplied  to  account  for  outside  listening. 


Since  the  total  increase  must  equal  the  sum  of  the  increase 
in  males  and  the  increase  in  females  we  have: 


l£  =  p 


<Lm  +  Lf> 


Also,  the  proportional  increase  for  males  is  to  be  three 
times  that  for  females,  i.e. 


These  equations  are  then  solved  for  P  and  Pf  at  each  hour 
lo  give  the  proportion  by  which  the  originalrmale  and  female 
listenership  is  to  be  increased  to  accoui.t  for  out-of-home 
listening. 
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Table  IV-8.  News  Broadcast  Ratings  and  Audiences,  and  Listening  Densities 
Sunday,  6:00  AM  -  1:00  AM 
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Radio  Listening  by  Age  and  Education 

We  have  now  calculated  the  total  audience  and  its 
breakdown  by  sex  for  each  of  the  radio  vehicles .  A  pattern 
quite  prominent  in  these  data  shows  women  always  outnumber¬ 
ing  men  in  the  audience.  Because  of  the  division  of  labor 
by  sex  in  the  culture,  women  had  much  more  opportunity  (and 
possibly  greater  need)  to  listen.  E/en  during  the  evening 
hours  when  the  opportunity  to  listen  was  relatively  more 
equal,  the  women  seemed  to  be  more  inclined  to  listen. 

This  interaction  between  listening  and  sex  is  the 
most  important  which  we  have  found.  In  the  appendix  of 
their  autumn,  1947  radio  study,  Lazarsfeld  and  Kendall  re¬ 
port  on  the  relationship  between  listening  and  several 

26 

other  of  our  population  dimensions.  Table  IV- 9  shows 
only  a  slight  association  between  amount  of  evening  listen¬ 
ing  and  age,  i.e. ,  younger  people  seem  to  listen  somewhat 
more.  This  tendency  is  also  found  in  Table  IV- 10  which 


shows  the  proportion  of.  heavy  listeners  by  age  and  education. 

However,  we  find  a  stronger  negative  association 
between  education  and  heavy  listening:  at  every  age  level, 
those  with  college  education  seem  to  listen  less.  Insofar 


as  listening  to  our  "news-on- the-hour"  vehicles  is  explained 


The  three  tables  which  follow  are  taken  from 
Appendix  C,  pp.  132-145  of  Paul  A.  Lazarsfeld  and  Patricia 
L.  Kendall,  Radio  Listening  in  America:  the  People  Look  at 
Radio — Again  (New  York :  Prentice-Hall ,  Inc.,  1948). 
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Table  IV-9.  Amount  of  Evening  Listening  by  Age 


Amount  of  Evening 

Age 

Listening 

21-29 

30-40 

50- 

Less  than  one  hour 

20% 

24% 

27% 

1-3  hours 

48 

49 

45 

3  or  more  hours 

32 

27 

28 

100% 

100% 

100% 

Source:  See  footnote  26 

• 

Table  IV-10.  Proportion 
Education 

of  Heavy 

Listeners'1 

by  Age  and 

Age 

Education 

21-29 

30-49 

50- 

College 

18% 

20% 

17% 

High  School 

36 

28 

28 

Grade  School 

33 

29 

31 

Heavy  listeners  are  those  who  listen  to  the  radio  three  02 
more  hours  on  an  average  weekday  evening. 

Source:  See  footnote  26. 
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more  by  total  listening  time  rather  than  selective  tuning, 
this  negative  association  between  education  and  amount  of 
listening  would  imply  less  news  exposure  for  the  college 
educated.  However,  the  better  educated  do  show  a  decided 
preference  for  both  news  broadcasts  and  public  affairs  pro¬ 
grams.  Table  IV-11  shows  that  at  each  age  level,  those 
with  higher  education  are  more  likely  to  prefer  news  broad¬ 
casts.  Despite  the  lesser  amount  of  listening  by  the  col¬ 
lege  educated,  they  may,  because  of  selective  tuning,  be 
as  much  or  more  exposed  to  news  broadcasting  as  those  of 
lesser  education. 


Table  IV-11.  Proportion  Choosing  News  Broadcasts  Among 
Evening  Program  Preferences  by  Age  and 
Education 


Education 

21-29 

Age 

30-49 

50- 

College 

75% 

78% 

83% 

High  School 

72 

75 

79 

Grade  School 

61 

70 

74 

Source:  See  footnote  26. 


Thus,  the  younger  and  less  well  educated  listen 
more,  but  the  older  and  better  educated  are  more  likely  to 
prefer  news  broadcasts.  On  the  basis  of  this  evidence  and 
the  lack  of  data  specifying  additional  audience  breakdowns 
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for  the  Cincinnati  news  vehicles ,  we  have  used  only  the 
tables  of  audience  by  sex  in  creating  the  simulation  radio 
vehicles. 

Other  Radio  Information 

For  the  purposes  of  this  simulation,  we  will  assume 
that  the  news  broadcast  vehicles  defined  above  carried  all 
the  significant  international  and  United  Nations  radio 
news.  Other  news,  in  the  form  of  special  programming,  e.g., 
information  programs  and  spot  announcements ,  was  broadcast 
over  individual  stations  and  therefore  would  require  defin¬ 
ing  additional  vehicles  if  it  were  to  be  included  in  the 
simulation. 

During  the  course  of  the  educational  campaign, 
several  special  United  Nations  information  programs  were 
presented.  Mr.  Robert  Adair,  the  public  relations  director 
for  the  Cincinnati  Plan,  recalls  that  there  were  four  or 
five  of  these  programs,  probably  broadcast  on  Sunday  after¬ 
noons.  ^  Since  we  have  no  additional  data  about  the  content 
of  these  programs,  and  since  they  account  fox  a  very  small 
proportion  of  the  total  radio  information,  we  have  not  de¬ 
fined  a  special  vehicle  to  include  them  in  the  simulation. 

The  second  known  omission  from  the  -adio  broadcast 
information  are  "spot"  announcements  about  the  United  Nations. 


27 

In  conversation  with  the  author. 
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Information  about  these  "spots*  is  sparse,  consisting  only 

of  passing  references  in  the  final  report  on  the  educational 

campaign  and  in  an  article  on  the  campaign  in  'Phe  New  York 

Times  Magazine.  From  the  report: 

.  .  .  The  radio  stations  broadcast  facts  about  the 
United  Nations ,  one  of  them  scheduling  spot  programs 
one-hundred-and-fifty  times  a  week.  .  .  . 

.  .  .  The  slogan,  "Peace  Begins  with  the  United 
Nations — the  United  Nations  Begins  with  You,”  which 
had  been  broadcast  at  the  end  of  one -minute  "spot" 
announcements  about  the  organization  one-hundred-and- 
fifty  times  a  week,  was  not  recalled  by  fifty-one  per 
cent  of  the  people.  .  .  . 

From  the  magazine  article: 

...  A  hundred  and  fifty  times  a  week  through  January 
and  February  WLW  dropped  in  "spots"  between  its  most 
popular  radio  programs:  "What  is  the  General  Assem¬ 
bly?  ***  It  is  the  Town  Hall  of  the  World."  .  .  .  ° 

In  order  to  model  these  spot  announcements,  we  would 

have  to  define  an  additional  twenty-one  vehicles,  the 

between-program  audiences  of  station  WLW.  In  view  of  the 

effort  required  and  the  evident  limited  amount  of  informa- 

29 

tion  contained  by  these  spots,  we  have  not  included  them 
in  the  simulation  scenarios. 


2  8 

The  two  quotations  from  the  report,  Star  and  Hughes, 
The  Cincinnati  Plan,  are  found  on  pp.  2  and  9.  The  third 
quotation  is  from  Gilbert  Bailey,  "To  Make  Us  Aware,"  New 
York  Times  Magazine,  March  7,  1948,  pp.  24-25. 

29 

Star  and  Hughes  an  The  Cincinnati  Plan,  p.  9, 
judge  that  the  fifty-one  per  cent  who  could  not  recall  the 
slogan  was  surely  an  underestimate: 

.  .  .  Another  ten  per  cent  claimed  that  they  had  heard 
it  but  had  no  idea  when  or  where.  Moreover,  hearing 
does  not  mean  understanding.  One  woman,  questioned  on 
the  slogan,  said:  "Why,  yes.  I  heard  it  over  and  over 
again  .  .  .  but  I  never  did  find  out  what  it  means.  .  .  ." 
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CONSTRUCTING  THE  MEDIA  SYSTEM  III:  GENERATING 
VEHICLE  EXPOSURE  PROBABILITIES  FROM  CUMULATION  DATA 

The  Importance  of  Cumulation 

In  Chapter  III  we  generated  a  population  distrib¬ 
uted  across  the  144  audience  types  defined  by  the  five 
dimensions.  In  Chapters  III  and  IV  we  defined  the  media 
vehicles  and  distributed  the  audience  of  each  vehicle 
across  the  144  audience  types .  The  ratio  of  the  vehicle 
audience  co  the  total  number  of  people  in  an  audience 
type  can  be  considered  a  mean  probability  of  exposure  to 
the  vehicle  for  members  of  the  given  audience  type .  The 
next  step  is  to  assign  probabilities  to  each  member  of 
the  audience  type  so  that  this  overall  mean  exposure  prob¬ 
ability  is  reproduced.  There  is,  obviously,  an  infinite 
number  of  sets  of  probabilities  which  fulfill  this  re¬ 
quirement;  how  shall  we  choose  from  among  these  sets? 

Consider  a  hypothetical  vehicle  which  has  an 
average  audience  of  20  per  cent  of  the  population,  i.e., 
a  mean  probability  of  exposure  of  0.20,  To  simplify 
matters,  let  us  consider  a  population  consisting  of  only 
ten  people.  We  can  show  that  different  sets  of 
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probabilities,  all  of  which  imply  the  same  mean  probabil¬ 
ity  of  0.20,  can  result  in  quite  different  rates  of  growth 
of  the  total  number  of  people  exposed  (the  cumulative 
audience),  or  distributions  of  frequencies  of  exposure. 

In  Table  V-l  below,  we  have  calculated  the  audience  cumu¬ 
lation  throuqh  five  time  periods  for  three  possible  sets 
of  probabilities.  In  the  first  set  each  probability  is 
the  mean  value  0.20;  in  the  second  set  two  probabilities 
have  the  value  1.00  and  the  remaining  eight  are  0.00;  in 
the  final  set,  two  probabilities  have  values  equal  to 
0.60  and  the  other  eight  have  values  of  0.10.  We  observe 
from  the  table  that  each  of  these  models  has  an  average 
audience  of  two  people;  however,  the  audience  cumulation 
is  quite  different.  The  model  for  which  all  the  probabil¬ 
ities  are  equal  to  the  mean  proDability  has  a  high  rate 
of  cumulation;  the  second  model  has  no  cumulation  whatso¬ 
ever — the  same  two  individuals  are  exposed  at  each  issue; 
and  the  third  model  has  a  moderate  rate  of  cumulation 


‘  | 

£ 


lying  between  the  other  two.  Thus,  the  distribution  of 
probabilities  over  the  population  governs  the  kind  of 
exposure  cumulation  (and  frequencies)  produced  by  the 


model. 


In  Appendix  B  we  have  outlined  the  mathematics 


relating  a  particular  model  of  the  population  probabilities 
to  observed  (or  estimated)  cumulation  and  frequency  data. 
The  model  divides  the  population  into  three  groups- -the 
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Table  V-l.  Growth  of  the  Cumulative  Audienc-  in  a  Popu¬ 
lation  of  Ten  Individuals  Through  Five  Time 
Periods  for  Three  Different  Probability  Dis¬ 
tributions  . 


Time 

Period 

Cumulative 

Number  of  Individuals  Exposed  At 
Least  Once 

Ten  0.20 
Probab’ lities 

Eight  0 . 0  and 
two  1 . 0 
Probabilities 

Eight  0.10  and 
two  0.60 
Probabilities 

1 

2.00 

2.00 

2.00 

2 

3.60 

2.00 

3.20 

3 

4.88 

2.00 

4.04 

4 

5.90 

2.00 

4.70 

5 

6.72 

2.00 

5.26 

regular  users  or  subscriber1;  of  the  vehicle,  the  moderate 
or  casual  users  of  the  vehicle,  and  the  very  infrequent 
users  of  the  vehicle.  For  each  of  these  groups,  a  beta- 
function1  distribution  of  probabilities  is  generated  from 
average  audience  and  two-period  cumulation  data  for  that 
group.  In  addition  we  need  to  know  the  size  of  each  of 
the  three  groups. 

Thus,  for  each  vehicle,  we  define  three  distribu¬ 
tions  of  probabilities.  For  each  distribution  (i) ,  we 
need  to  estimate  the  size  (K1) ,  average  audience  (C^1) , 
and  two-period  cumulation  (C21) •  However,  there  are 

1For  a  complete  discussion  of  the  model  and  the 
definition  and  graph  of  the  beta-function,  see  Appendix 
B. 


167 


several  constraints  upon  chese  quantities.  The  three 
distribution  sizes  must  sum  to  the  total  population,  the 
three  average  audiences  to  the  vehicle  audience,  and  the 
three  two-period  cumulations  to  the  vehicle  two-period 
cumulation.  In  addition  a  very  important  constraint, 
arising  from  the  beta  function  model,  relates  the  aver¬ 
age  audience  and  two-period  cumulation  of  each  distribu¬ 
tion:  the  proportion  of  the  distribution  population  in 
the  two-period  cumulation  may  not  exceed  the  difference 
between  twice  the  audience  propor.-on  and  the  square  of 
the  audience  proportion.  For  example,  if  the  average 
audience  is  20  per  cent  of  the  population,  the  maximum 

value  of  the  two-period  cumulation  is  36  per  cent  of  the 

2 

population  £2  ( . 20 )  -  (.20)  ].  This  relation  is  expressed 
algebraically  as: 


If  we  let  P^  represent  the  audience  proportion  and  P2 
represent  the  cumulation  proportion  then  the  equation 
above  becomes 


P2  1  2Pl  -  Px 


A  useful  concept  is  the  relative  two-period  cumulation, 
defined  as  the  ratio  of  C 2  to  C-^  (or  ?2  to  P^)  .  From 
the  equation  above  the  relative  accumulation  is  also 
limited  by  the  average  audience: 


> 
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Relative  twc -period  accumulation  =  P2/P^  —  ^  ”  Pl* 

Since  the  possible  values  of  lie  between  0.00  and  1.00, 
the  ratio  P2/P^  must  between  1.00  and  2.00;  moreover, 
the  larger  the  value  of  P^,  the  smaller  must  be  P2/P^. 

In  other  words,  as  the  average  audience  increases,  the 
number  not  yet  exposed  (who  contribute  to  the  increase 
of  C2  over  C^)  grows  smaller.  Thus  the  maximum  possible 
value  of  the  ratio  C2/C^  decreases.  In  the  limit,  as 
the  average  audience  includes  the  entire  population, 
the  two-period  cumulation  must  just  equal  the  average 
audience  and  the  ratio  C2/C^  is  identically  1.00. 

The  point  where  the  maximu  .  cumulation  is  attained 

is  usually  just  the  case  of  greatest  randomness  (or  least 

2 

structure)  in  the  population.  At  this  point,  all  mem¬ 
bers  of  the  population  have  identical  probabilities, 
equal  to  the  proportion  of  the  population  in  the  average 
audience.  Generally,  the  more  we  can  divide  the  popula¬ 
tion  into  subgroups  (such  as  very  frequent  listeners 
versus  non-listeners)  characterized  by  probabilities 
divergent  from  the  overall  nean  probability  of  exposure — 
that  is,  the  more  structure  we  can  discover  in  the  audi¬ 
ence  patterns,  the  lower  the  resulting  cumulation. 

Cumulation  occurs  when  a  person  not  previously 

2 

This  is  also  the  point  where  the  variance  of  the 
average  audience,  cumulations,  etc.,  is  the  greatest. 
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exposed  to  the  vehicle,  becomes  exposed  to  it.  Therefore 
if  each  issue  of  a  vehicle  ten's  to  expose  the  same  peo¬ 
ple  repeatedly,  the  vehicle  will  tend  to  have  a  lower 
relative  accumulation.  Thus  we  may  expect  that  those 
vehicles  whose  average  audience  consists  in  large  part 
of  persons  with  a  high  probability  of  exposure  should 
have  a  low  relative  accumulation.  An  example  of  this 
might  be  a  magazine  with  a  high  subscription  ratio  or  a 
radio  soap  opera  whose  audience  likely  excludes  nearly 
all  of  the  males  and  cne-half  to  two-thirds  of  the  fe¬ 
males  of  the  population. 

Below  we  have  summarized  these  observations  about 
accumulation  for  this  model: 

1)  The  minimum  value  of  the  two-period  rela¬ 
tive  accumulation  is  1.0.  Its  maximum 
value  is  2.0-P^  where  P.  is  the  audience 
proportion. 

2)  It  appears  likely  both  from  the  moiel  and 
from  the  data  that  ceteris  paribus  the 
larger  the  audience  proportion,  the  smaller 
the  relative  accumulation;  thus  the  audi¬ 
ence  proportion  which  has  already  been 
calculated  for  each  vehicle  acts  as  a 
strong  constraint  on  the  other  variables. 

3)  The  larger  the  proportion  of  the  population 


in  the  high  probability  distribution,  the 
smaller  the  relative  accumulation. 


We  turn  now  to  the  actual  calculation  of  K,  C^,  and 
C2  for  each  distribution  of  each  vehicle. 

Distributions  and  Cumulation  for  Newspapers 

We  first  estimate  tVj  proportion  of  the  population 
in  each  subscriber  (high  pro- Ability)  distribution.  Table 
V-2  below,  taken  from  the  ABC  Audit  Report  for  the  Post 
shows  the  distribution  of  copies  by  edition  and  region. 
From  conversation  with  the  Post  circulation  managers,  we 
have  estimated  the  following  distribution  pattern  for 
each  of  the  editions: 


Of  Those  in 


Edition 

Distribution 

Percent  in 
City  Zone 

City  Zone, 
Per  Cent  to 

Subscribers 

1st 

all  street  sales 

83% 

0% 

2nd 

mostly  suburban  areas  7 

0 

3rd 

Kentucky :  1/4  to 

streets,  3/4  to 
homes 

82 

75 

4th 

mostly  city  homes 

100 

100 

5th 

1/2  to  city  homes, 
1/2  to  streets 

98 

50 

6th 

all  street  sales 

98 

0 

Predate 

mostly  outside  city 

1 

0 
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The  product  of  the  site  of  each  edition,  the  per  cent  dis¬ 
tributed  in  the  city  zone,  and  the  proportion  of  city  zone 
copies  going  to  subscribers,  gives  the  number  of  copies 
going  to  subscribers.  This  number  ia  approximately  80,000 
copies  or  64  per  cent  of  city  zone  sales.  We  have  used 
this  proportion  for  each  of  the  daily  newspapers.  If  we 
assume  that  this  is  also  the  number  of  subscriber  house- 
nolds  and  use  the  previously  calculated  estimate  of  2.23 
adults  per  household  (see  p.110  above)  we  can  calculate 
the  number  of  persons  in  the  subscriber  distributions  for 
each  paper: 

Calculation  of  the  Number  of  Newspaper  Subscribers 


Newspapers 


Post 

Times -Star 

Daily  Enquirer 

City  Zone 
Circulation 

125,008 

4 

132,441 

103,500 

@64%  into 

subscriber 

households 

80,000 

hones 

84,700 

homes 

66,300 

homes 

@2.23  adults 
per  household 

178,000 

adults 

189,000 

adults 

148,000 

adults 

%  of  all  adults 
in  subscriber 

29% 

31% 

24% 

distribution 

Since  the  number  of  adult  readers  per  copy  is  about 
3 

2.29  but  the  number  of  adults  per  household  is  at  most 

3 

This  was  calculated  'above  from  the  corrected  NORC 
survey  data. 
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2.23,  the  mean  exposure  probability  for  each  of  these 
adults  must  be  rather  high.  We  have  chosen  a  mean  proba¬ 
bility  of  .95  in  each  case.  The  audiences  thus  produced 
are  shown  below: 

Newspaper  Audiences  in  the  High 
Probability  Distribution 


Post 


_ Newspapers _ 

Times-Star  Daily  Enquirer 


Size  o'*  High 
Distribution 

178,000 

189,000 

148,000 

(%  of  Popula¬ 
tion) 

29% 

31% 

24% 

Mean  Exposure 
Probability 

.95 

.35 

.95 

Average  Audience 
of  the  High  Dis¬ 
tribution  (C^ ) 

169,000 

179,500 

140,500 

(%  of  Popula¬ 
tion) 

27.5% 

29.5% 

22.8% 

A  1961  national  study  of  newspaper  readership  found 
that  on  an  average  weekday,  74.2  per  cent  of  the  popula¬ 
tion  were  pri nary  readers  of  daily  newspapers.  Primary 
readers  were  defined  as  tho.°^  readers  who  received  the 
newspaper  in  their  homes  or  bought  it  directly  from  a 

4 

Audits  &  Surveys  Company,  Inc.,  A  National  Study  of 
Newspaper  Reading,  Vol.  1,  p.  15.  r*..ie  3a t a  are*  for  Metro¬ 
politan  areas  of  500,000  or  more  inhabitants.  The  data 
for  Sunday  readership  cited  in  the  next  paragraphs  sure  found 
on  page  46  of  the  same  volume. 
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newsstand;  thus,  primary  readers  included  all  readers  ex¬ 
cept  passalong  readers.  From  our  estimates  above  of  the 
proportion  of  street  sales  (36  per  cent)  and  home  deliv¬ 
eries  (64  per  cent) ,  we  would  expect  about  two-thirds  of 
the  primary  readers  or  about  50  per  cent  (two-thirds  of 
74.2  per  cent)  of  the  population  to  receive  their  copies 
via  home  delivery.  Remarkably,  if  we  assume  random  du¬ 
plication  between  newspapers,  these  three  average  audi¬ 
ences  imply  a  net  average  subscriber  audience  of  50.5 
per  cent  of  the  population.  This  lends  some  confidence 
to  our  estimates. 

For  the  Sunday  paper,  we  have  guessed  that  a  much 
higher  percentage  of  the  copies  are  delivered  to  house¬ 
holds.  Whereas  we  estimated  that  64  per  cent  of  the 
daily  papers  were  delivered  to  households,  we  shall  set 
85  per  cent  as  the  proportion  of  Sunday  papers  delivered 
to  households.  Using  2.23  adults  per  household,  this 
gives  341,000  adults  (180,  150  copies  times  .85  times 
2.23)  or  55.2  per  cent  of  the  population  in  the  sub¬ 
scriber  (high  probability)  distribution.  Since  this  is 
such  a  large  proportion  of  the  total  population,  it  seems 
unlikely  that  the  mean  exposure  probability  will  be  as 
large  as  for  the  daily  papers  (.95).  Therefore,  we  have 
used  a  mean  probability  of  .85. 


The  Low  Probability  Distribution 

We  have  no  information  about  the  number  of  people 
for  each  newspaper  who  are  almost  never  exposed  to  the 
particular  newspaper.  Therefore ,  we  have  arbitrarily 
assumed  that  one-half  of  those  not  in  the  average  audi¬ 
ence  of  a  paper  are  in  this  low  distribution  and  that 
its  mean  probability  is  0.05.  This  implies  that  these 
infrequent  readers  of  a  given  newspaper  see  an  average 
of  one  of  twenty  issues.  We  have  calculated  previously 
the  following  total  average  audience  values  for  each 
newspaper: 

Post 
293,900 
47.6% 

Given  the  size  and  mean  .probability  for  both  the  high 
and  low  distribtuions ,  the  size  of  the  middle  distribu¬ 
tion  is  just  the  remainder  of  the  population  and  its 
mean  probability  is  just  the  remainder  of  the  audience 
divided  by  the  size.  These  results  are  summarized  in 
Table  V-3  below. 


Newspaper  Average  Audience 

Tiroes- Star  Enquirer  (daily)  Enquirer  (Sunday) 
309,300  242,100  369,900 

50.1%  39.2%  64.3% 


"*r?£***J &C02Z33G!#  '»*»■»,- 
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From  the  size  and  mean  probability  for  each  distri¬ 
bution/  we  can  calculate  the  maximum  value  of  the  relative 
accumulation  (CjVc^1)  for  each  and  the  total  for  the 
newspaper r  This  maximum  value  for  the  population  struc¬ 
tured  by  the  three  distributions  can  be  compared  with 
the  maximum  implied  simply  by  the  total  audience/  i.e., 
the  maximum  relative  accumulation  (C2/C^)  of  the  same 
total  audience  for  an  unstructured  population.  These 
values  are  shown  in  Tables  V-4  and  V-5. 

Table  V-4  Maximum  Two-Period  Cumulative  Audiences  for 
Each  Newspaper  Distribution 


Distribution 

Maximum  Two-Period  Cumulation  as  a  Per¬ 
centage  of  the  Total  Population  for 

Each  Distribution 

Post 

Times -Star 

Enquirer 

"(Dally) 

Enquirer 

(Sunday) 

High 

28.90% 

30.90% 

23.95% 

53.75% 

Middle 

29.70 

29.96 

24.90 

23.15 

Low 

2.55 

2.44 

2.96 

1.74 

Total 

51.15% 

63.30% 

51.81% 

78.64% 

Thus  we  observe  that  the  simple  structuring  of  the 
population  into  three  distributions  with  three  mean 
probabilities  has  significantly  decreased  the  maximum 
relative  accumulation.  Comp  curing  the  cumulation 
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Table  V-5.  Comparison  of  the  Maximum  Two-Period  Cumula¬ 
tive  Newspaper  Audiences  under  the  One-  and 
Three-Beta  Function  Models. 


I 


Maximum  Relative 

Accumulation 

<Vci> 

Post 

Times -Star 

Enquirer 

■  ToaiTyT 

Enquirer 

(Sunday) 

3-Distribution 

Case 

1.284 

1.264 

1.322 

1.224 

1-Distribution 

Case 

1.5  £4 

1.499 

1.608 

1.357 

percentages  above  with  the  audience  percentages  in  Table 
V-3,  we  observe  that  only  in  the  middle  distribution  is 
there  a  significant  increase  in  exposure  possible.  The 
high  distribution  population  has  been  almost  completely 
(85  or  95  per  cent)  exposed  by  the  first  issue,  and  ex¬ 
posure  is  so  small  (5  per  cent)  for  each  issue  in  the 
low  distribution  that  it  fails  to  accumulate  significant¬ 
ly;  thus,  most  of  the  possible  cumulation  is  restricted 
to  the  one-quarter  to  one-half  of  the  population  in  the 
middle  distribution.  In  matters  little  what  figures  cure 
chosen  for  the  two-period  emulations  in  the  low  and  high 
distributions;  the  significant  figure  is  tl  at  of  the 
middle  distribution. 

We  have  only  two  items  of  data  relating  to  newspaper 
cumulation,  both  from  the  1961  newspaper  readership 
study.5  For  all  the  daily  newspapers  (with  a  total 

5 See  reference,  note  >. 
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average  audience  of  79.7  per  cent  of  the  population),  the 
relative  accumulation  value  is  1.055.  This  value  is 
quite  low,  but  then  the  average  audience  is  extremely 
high.  No  single  newspaper  has  so  high  an  average  audi¬ 
ence.  He  must  assume  that,  at  best,  this  figure  .s  an 
extreme  lower  bound  for  the  relative  accumulation.  For 
the  Sunday  newspapers  (74.60  per  cent  average  audience) 
the  relative  accumulation  is  1.045.  Since  the  average 
audience  of  the  Sunday  Enquirer  is  somewhat  lower  (64.30 
per  cent)  than  the  combined  newspapers  of  the  study,  we 
expect  the  Sunday  Enquirer 1 s  relative  accumulation  to  be 
•omewhat  higher.  With  little  guidance  from  any  data,  we 
have  arbitrarily  set  the  two-period  cumulation  for  each 
distribution  at  the  midpoint  between  the  average  audi¬ 
ence's  minimum  value  and  the  maximum  possible  cumulation. 
Thus,  the  second  issue  exposes  one-balf  of  those  who 
would  be  newly  exposed  if  exposure  were  completely  ran¬ 
dom  from  issue  to  issue  within  the  distribution.  Table 
V-6  displays  the  resulting  cumulations. 

Distributions  and  Cumulation  for  Radio 

He  consider  first  the  weekday  radio  broadcasts.  The 
audience  data  calculated  below  (Table  V-6)  and  a  consid¬ 
eration  of  tkw  regularity  of  living  patterns  during  the 
hours  from  7:00  A.M.  through  5:00  P.H.  seem  to  invite  the 
common  treatment  of  news  broadcast  audiences  during  these 
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hours.  The  percentage  of  men  in  the  radio  audience  at 
these  times  varies  only  from  1.48  per  cent  to  7.19  per 
cent;  thus  a  large  porportion  of  the  men  are  rarely  if  at 
all  in  these  radio  audiences.  For  the  women  the  range  is 
somewhat  larger,  from  4.62  per  cent  to  17.77  per  cent, 
and  the  values  somewhat  higher  than  those  fo~~  he  men, 
as  might  be  expected.  The  total  audiences  during  this 
time  vary  from  3.14  to  12 , 42  per  cent  of  the  population. 
Therefore,  we  have  decided  to  treat  news  broadcasts  at 
these  hours  as  a  group. 

The  Male  Audiences 

During  the  hours  from  7:00  A.M.  through  5:00  P.M. , 
the  audience  data  of  Table  V-6  shows  that  not  more  than 
7.19  per  cent  of  the  men  cure  ever  in  the  news  broadcast 
audience.  At  8:00  A.M.  when  every  station  is  carrying 
news  and  the  male  listening  density  is  still  moderately 
high,  the  maximum  value  audience  of  the  period  is  reached 
at  7.19  per  cent.  We  can  assume  that  at  any  of  these 
hours  on  any  given  day  the  proportion  of  males  not  in 
the  radio  audience  at  all  is  quite  large.  (Note  that 
the  audience  figures  include  a  correction  for  listening 
outside  the  home.)  Thus  we  have  assigned  the  men  to  dis¬ 
tributions  in  the  following  manner: 

1.0  per  cent  of  the  men  are  assigned  to  a  high  proba¬ 
bility  distribution  with  mean  probability  of  0.70. 


182 


29.0  per  cent  of  the  men  are  assigned  to  a  middle 
probability  distribtuion  with  a  mean  probability  deter¬ 
mined  by  the  average  audience  of  the  vehicle. 

70.0  per  cent  of  the  men  are  assigned  to  a  low  proba¬ 
bility  distribution  with  a  mean  probability  of  0.04. 

The  Female  Audiences 

For  these  daytime  hours  we  do  have  some  data  about 
women's  listening  habits.  Lazar sf eld,  in  a  study  of  women 
residents  of  New  York,  Cleveland,  Chicago  and  Kansas  City 
conducted  in  the  late  1940's,  found  the  following  distri- 

g 

bution  of  listenerships  among  the  women: 


Inaccessibles 

34% 

Non-Listeners 

24 

Story  Audience 

19 

Other  Listeners 

23 

100% 

According  to  these  figures  fifty-eight  per  cent  of  the 
women  very  rarely  listen  during  these  hours.  The  other 
forty-two  per  cent  do  listen.  In  choosing  the  distribu¬ 
tion  sizes  for  women,  we  have  also  noted  that  the  minimum 

6The  study  is  reported  in  Paul  F.  Lazarsfeld  and  Frank 
N.  Stanton,  eds..  Communications  Research  1948-1949,  (New 
York:  Harper  &  Brothers,  1^50) ,  p  76-85.  Although  these 
terms  were  originally  used  to  describe  the  morning  listening 
habits  of  the  women,  we  shall  assume  that  these  hold  true 
also  for  the  afternoon.  Lazarsfeld  indicates  in  the  study 
(p.  81)  that  these  listening  differences  cure  strongly  main¬ 
tained  in  the  afternoon  and  even  to  some  degree  into  the 
evening . 


'  / 
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audience  for  women  is  as  low  as  4.62  per  cent.  The  women 

7 

then  are  to  be  distributed  as  follows: 

3.0  per  cent  of  the  women  are  assigned  to  a  high 
probability  distribution  with  mean  probability  of  0.70. 

39.0  per  cent  of  the  women  are  assigned  to  a  middle 
probability  distribution  with  a  mean  probability  deter¬ 
mined  by  the  average  audience  of  the  vehicle. 

58.0  per  cent  of  the  women  are  assigned  to  a  low 
probability  distribution  with  a  mean  probability  of  0.04. 

Recalling  that  the  composition  of  the  population  is 
47.8  per  cent  male  and  52.7  per  cent  female,  we  may  now 
produce  the  three  total  distributions  of  listeners  and 
the  average  audiences  accounted  for  by  those  of  known 
mean  probability: 


Size  of 
Distribution 

Mean 

Probability 

Average 

Audience 

High  Distribution 

12,700 

2.05%a 

0.70 

8,300 

1.44%a 

Middle  Distribution 

211,800 

34.27% 

To  be  determined  from 
the  individual  audience 
vehicles 

Low  Distribution 

393,000 

63.68% 

0.04 

15,700 

2.55% 

^he  percentages  are  based  on  the  total  population. 

70f  course,  the  generation  and  assignment  of  probabil¬ 
ities  won’t  strictly  follow  these  male  and  female  distri¬ 
butions;  only  three  distributions  are  generated  by  the 
cumulation  routine.  We  are  using  this  dichotomy  (which 
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The  average  audience  accounted  for  by  the  high  and 
low  probability  distribution  is  24,600  or  3.99  per  cent 
of  the  population.  For  three  of  the  vehicles  (the  9:00 
A.M. ,  10:00  A.M. ,  and  3:00  P.M.  news  broadcasts)  this 
partial  audience  is  larger  than  the  total  vehicle  audience 
For  these  three  vehicles  we  have  lowered  the  mean  proba¬ 
bility  of  the  high  distribution  to  0.5  and  vhe  low  distri¬ 
bution  to  0.02.  The  calculated  means  are  presented  in 
Table  V-7. 

As  with  the  newspaper  data  above  we  have  arbitrarily 
set  the  two-period  cumulation  for  each  distribution  at 
the  mid-point  between  its  minimum  and  maximum  possible 
values.  Comparing  the  resulting  relative  accumulations 
with  those  of  the  four  radio  programs  measured  by  Politz 
(Table  B-l)  we  observe  that  the  calculated  relative  ac¬ 
cumulations  are  slightly  lower.  However,  the  evening 
programs  (those  measured  by  Politz)  have  a  potential 
audience  of  most  of  the  population;  the  weekday  morning 
and  afternoon  programs,  however,  probably  exclude  nearly 
all  of  the  males  from  the  potential  audience.  Thus  the 
relative  accumulation  should  be  slightly  lower.  The 
final  values  are  shown  in  Table  V-8. 

will  shortly  be  united)  dimply  as  a  heuristic  to  dis¬ 
cover  the  forms  of  the  three  general  distributions. 


Table  V-7.  Mor.day-Friday  Morning  and  Afternoon  Audience  Distributions  and  Mean 
Probabilities 
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Evening  Listeninc 


For  the  weekday  evening  listening  distributions,  we 
can  find  no  better  data  than  that  of  an  NORC  national 

O 

survey  of  fall,  1947.  Respondents  were  asked  to  esti¬ 
mate  the  total  number  of  hours  that  they  spent  listening 
in  the  morning,  in  the  afternoon,  and  in  the  evening. 

Rearranging  these  figures  for  evening  listening,  we  find 

g 

the  following  distribution  of  listening: 


Percentage  of 
the  Population 

13.2% 


Average  Probability 
of  Listening 

.000 


.025 


.076 


14.6 


.150 


24.7 


.300 


20.1 


.492 


14.6 


.700 


100.0% 


.900 


1.000 


Paul  A.  Lazarsfeid  and  Patricia  L.  Kendall,  Radio 
Listening  in  America:  The  People  Look  at  Rad.io-Again 


(New  York;  Prentice-Ealx ,  Inc.,  1948),  p. 

g 

Assuming  that  most  morning  listening  occurs  in  the 
hours  from  7:00  A.M.  to  noon,  the  afternoon  listening  in 
the  hours  from  noon  to  6:00  P.M.  and  most  evening  listen¬ 
ing  in  the  five  hours  from  6:00  P.M.  to  11:00  P.M.,  we  can 
convert  the  respondent’s  listening  estimates  into 
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It  is  very  difficult  to  use  this  distribution  directly  for 
our  purposes  since  the  listening  described  is  an  average 
over  all  broadcasts  at  a  given  time,  not  simply  news  broad¬ 
casts.  (The  average  audience  for  weekday  evening  news 
broadcasts  is  never  larger  than  27  per  cent  of  the  popu¬ 
lation,  because  at  the  peak  listening  hours,  one  or  more 
of  the  radio  stations  is  not  broadcasting  news.) 

The  data  indicate  that  18  per  cent  of  the  population 
rarely  listen  to  anything  during  the  evening,  about  22 
per  cent  cure  constantly  listening  to  something ,  while  the 
remaining  60  per  cent  listen  occasionally  to  something. 
Therefore,  for  the  evening  news  broadcast  with  the  largest 
audience  (27.24  per  cent) ,  we  have  chosen  to  make  the  low 
probability  (0.03)  distribution  25  per  cent  of  the  popu¬ 
lation.  The  corresponding  high  probability  (0.75)  dis¬ 
tribution  will  equal  20  per  cent  of  the  population.  For 
the  other  vehicles,  the  lower  the  average  audience,  the 
larger  the  low  distribution  and  the  smaller  the  high  dis¬ 
tribution  from  this  base  line  thus  established.  Table 
V-9  shows  the  resulting  distribution  sizes  and  mean  prob¬ 
abilities. 

probabilities  of  listening  at  any  time.  This  procedure 
gives  average  audiences  of  19,  15  and  38  per  cent  for  the 
morning,  afternoon  and  evening  hours,  respectively.  The 
Hooper  average  ratings  for  Cincinnati  during  these  hours 
were  18.3,  22.8  ar  38.1  per  cent  respectively.  There¬ 
fore,  we  have  some  confidence  in  these  personal  estimates. 


at  all  at  thia  hour,  or  baoauaa  Caw  nawa  broadoaata  vara  achalulad,  or  both 


For  the  two-period  cululations  of  each  distribution 
we  have  used  the  procedures  as  described  above  for  the 
daytime  audiences/  with  the  exception  that  three-fourths 
rather  than  one-half  of  the  maximum  additional  exposure 
was  allowed.  This  made  the  values  of  the  relative  two- 
period  cumulation  comparable  to  the  Politz  radio  data 
cited  below  (p.  497).  The  resulting  values  are  tabulated 
below.  (See  Table  V-10.) 

Cumulations  for  Saturday  and  Sunday 

In  estimating  the  distribution  sizes,  mean  probabil¬ 
ities,  and  two-period  cumulations  for  these  days,  we  have 
made  the  following  assumptions: 

1.  We  have  assumed  that  Saturday  mornings  ax i  like  the 
weekday  mornings  already  calculated,  i.e  ,  most  of 
the  men  and  a  large  proportion  of  the  women  are  not 
often  available  for  listening. 

2.  The  remaining  Saturday  and  Sunday  vehicles  are  like 
weekday  evenings,  i.e.,  the  men  cure  available  and  the 
programs  have  relatively  high  relative  two-period 
cumulations . 

We  note  that  for  each  week  in  the  time  period  of  the 
simulation,  each  weekday  vehicle  will  run  five  times  com¬ 
pared  to  one  run  of  each  weekend  vehicle.  Thus,  the 
weeku.'.y  vehicles  should  have  a  much  more  important  effect 
on  exposure.  The  weekend  vehicle  data  are  shown  in  Tables 


V-ll  to  V-14  below. 


Tabl«  V-10.  Monday-Prlday  Evaning  Radio  Two -Period  Cuaulativa  Audianoa  by  Diatribution 
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Table  V-ll.  Distribution  Sizes  for  Saturday  Morning  News 
Broadcasts 


Distribution 


Distribution 

Size 


Per  Cent  of 
Population 


High  12,700  2.05% 

Middle  211,800  34.27 
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Low 


393,000 


63.68 
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The  news  broadcast  audience  at  this  hour  is  insignificantly  small,  either  because 
there  were  few  listeners  at  all  at  this  hour,  or  because  few  news  broadcasts  were 
schedules,  or  both. 
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aTh«  news  broadcast  audience  at  this  hour  is  insignificantly  small,  either  be¬ 
cause  there  were  few  listeners  at  all  at  this  hour,  or  because  few  news  broad¬ 
casts  were  scheduled,  or  both. 
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The  news  broadcast  audience  at  this  hour  is  insignificantly  small,  either  be¬ 
cause  there  were  few  listeners  at  all  at  this  hour,  or  because  few  news  broad¬ 
casts  were  scheduled,  or  both. 


Table  v-16 .  Sunday  Mawa  Broadcast  Two-Period  Cumulative  Audiences 
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broadcast  audience  at  this  hour  is  insignificantly  amall,  either  because  there  were  few  listeners 
>  this  hour,  or  because  few  news  broadcasts  ware  scheduled,  or  both. 


CHAPTER  VI 


CONSTRUCTING  THE  MEDIA  SYSTEM  IV:  THE  AUDIENCE 
DUPLICATIONS  AND  ASSIGNMENT  OF  EXPOSURE  PROBABILITIES 

Allocating  Probabilities  to  Population  Subgroups 

In  Chapter  I  we  introduced  the  basic  ideas  relating 
to  the  allocation  of  the  probabilities  generated  from 
the  cumulation  statistics  to  the  cells  defining  the  popu¬ 
lation  subgroups.  Before  we  present  the  treatment  of 
duplication  in  the  model ,  we  need  to  describe  in  detail 
this  allocation  procedure. 

In  the  first  link  of  the  simulation,  we  generated  the 
simulation  population  and  the  audiences  for  each  communi¬ 
cation  vehicle  distributed  over  the  population  types  de¬ 
fined  by  the  population  dimensions.  The  ratio  of  the  audi¬ 
ence  in  a  cell  to  the  population  of  that  cell  gave  a  re¬ 
sulting  mean  probability  of  exposure  for  the  particular 
cell.  In  the  second  link  of  the  simulation,  we  introduced, 
for  each  vehicle,  the  cumulation  information  (described  in 
Chapter  V),  for  each  of  three  distributions.  Then,  for 
each  distribution,  we  calculated  the  parameters  of  the 
beta  function  and  created  a  number  of  probabilities  equal 
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to  the  distribution  size  and  distributed  at  intervals  from 
0.0  to  1.0  in  approximately  the  density  of  the  beta  func¬ 
tion.  (Of  course,  the  discreet  probabilities  do  not  ex¬ 
actly  duplicate  the  beta  function?  therefore,  if  the 
average  audience  is  slightly  different  from  that  required, 
the  simulation  makes  a  slight  correction  in  these  discreet 
probabilities.)  Finally,  for  each  vehicle  we  have  three 
distributions  of  discreet  probabilities,  the  number  of 
probabilities  in  each  distribution  equal  to  the  number  of 
simulated  persons  in  that  distribution,  and  the  probabil¬ 
ities  chosen  in  such  a  way  as  to  reproduce  the  cumulation 
data  for  each  distribution  and  for  the  vehicle  as  a  whole. 
The  next  step  is  to  allocate  these  probabilities  to  the 
cells  of  the  simulation  population  in  such  a  way  as  to 
reproduce  each  cell's  average  audience  for  the  vehicle. 

How  shall  we  decide  whicn  probabilities  are  allocated 
to  which  cells?  First,  the  number  allocated  to  any  cell 
must  equal  the  population  size  for  the  cell.  Second,  the 
average  of  the  allocated  probabilities  should  ecr.ial  the 
vehicle  average  audience  in  the  cell.  I*  we  have  only 
two  distributions  of  probabilities,  if  we  draw  randomly 
without  replacement  from  the  distributions,  and  if  we  are 
willing  to  modify  the  second  condition  such  that  the 
expected  value  (over  many  draws)  of  the  average  of  the 


allocated  probabilities  be  equal  to  the  cell  mean,  then  we 
can  specify  for  each  cell  the  exact  number  of  probabilities 
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to  be  drawn  randomly  from  the  high  and  low  distributions. 
However,  if  we  have  three  distributions,  we  need  another 
condition  to  specify  exactly  the  number  of  probabilities 
for  each  cell  to  be  drawn  from  each  distribution.  We  have 
actually  specified  a  function  which  relates  the  cell  mean, 
the  average  of  the  probabi lities  of  each  of  the  three  dis¬ 
tributions,  and  the  number  of  probabilities  to  be  drawn 
from  each  distribution,  in  order  to  provide  the  third  con¬ 
dition  to  solve  the  equations.^-  However,  this  method  of 
allocating  probabilities,  while  sufficiently  accurate  for 
cells  with  a  large  number  of  probabilities,  (in  that  it 
will  closely  reproduce  the  cell  mean) ,  is  not  accurate 
enough  for  small  cells.  Therefore,  we  have  developed 
another  method  for  allocating  probabilities  which  takes 
account  of  the  cell  mean  to  be  reproduced  after  each  draw 
of  a  probability.  This  method  is  used  in  the  simulation 
as  presently  programmed;  it  is  described  in  the  succeeding 
paragraphs . 

Let  us  call  the  three  distributions  of  probabi lities 
produced  for  each  vehicle  from  the  cumulation  data  the  low, 
the  middle,  and  the  high  probability  distributions.  Of 

^This  solution  to  the  problem  is  discussed  in  detail 
in  John  P.  Kramer,  The  Three  Distributions  Case :  How  to 
Choose  the  Proper  Number  of  Probabilities  for  Each  Distri¬ 
bution  When  there  are  Three  Distributions,  COMCOM/Simula- 
iion  Memo  #33,  January  27,  1966.  It  turns  out  that  any 
function  lying  within  a  specified  convex  set  and  passing 
through  two  of  the  comers  of  that  set  will  provide  a 
satisfactory  solution. 


course,  the  probabilities  in  each  of  these  distributions 
may  range  from  0.0  to  1.0.  However,  the  average  value  of 
the  probabilities  in  the  low  distribution  will  be  the 
lowest  of  all  the  distributions  with  correspondingly 
higher  average  probabilities  for  the  other  two  distribu¬ 
tions.  He  define  boundaries  between  the  distributions  at 
the  following  points:  the  boundary  between  the  low  and 
middle  distributions  i s  the  average  of  the  mean  probabil¬ 
ity  of  the  low  distribution  and  the  mean  probability  of 
the  middle  distribution;  the  boundary  between  the  middle 
and  high  distributions  is  the  average  of  the  mean  proba¬ 
bility  of  the  middle  distribution  and  the  mean  probability 
of  the  high  distribution.  Now  we  look  at  the  first  cell 
in  the  simulation.  The  cell  mean  for  this  first  cell 
falls  in  the  range,  as  defined  by  these  boundaries,  either 
of  the  low,  the  middle,  or  the  high  distribution.  For  in¬ 
stance,  if  the  boundary  between  the  low  and  the  middle 
distribution  is  .30  and  the  mean  of  the  cell  is  .24,  then 
the  cell  mean  at  this  point  falls  in  the  range  of  the  low 
distribution.  Therefore,  in  order  to  best  approximate 
this  cell  mean,  we  draw  randomly  from  the  probabilities 
of  the  low  distrioution  and  assign  the  resulting  probabil¬ 
ity  to  this  cell.  At  the  same  time,  we  eliminate  this 
probability  from  the  list  of  low  probabilities.  This  low 
probability  may,  in  fact,  take  on  any  value  between  0.0 


and  1.0;  however,  in  general  it  will  have  a  value  near  the 
average  value  for  the  low  distribution. 

At  this  point  there  remains  one  less  probability  to 
be  assigned  to  this  cell.  The  required  average  of  these 
remaining  probabilities  is  not,  however,  equal  to  the  cell 
mean.  If  the  first  probability  assigned  happened,  by 
chance,  to  be  lower  them  the  cell  mean,  then  the  average 
of  the  remaining  probabilities  must  be  higher  than  the 
cell  mean  in  order  that  the  average  probability  for  the 
cell  be  equal  to  the  cell  mean.  On  the  other  hand,  if 
the  assigned  probability  happened  to  be  larger  than  the 
cell  mean,  the  average  of  the  remaining  probabilities 
must  be  smaller  than  the  cell  mean.  Therefore,  we  cal¬ 
culate  this  new  required  average  probability  for  the  yet- 
to-be  assigned  probabilities  and  again  draw  randomly  from 
the  appropriate  distribution  implied  by  this  new  average, 
deleting  the  new  probability  from  the  list  after  it  has 
been  assigned  to  the  cell.  This  process  is  repeated 
probability-by-probability  and  cell-by-cell.  If  a  list  is 
exhausted,  the  probabilities  are  drawn  from  the  distribu¬ 
tion  which  is  closest  to  the  mean  of  the  probabilities  to 
be  drawn.  Eventually,  all  the  probabilities  are  assigned 
in  this  way  to  all  of  the  cells.  This  method  of  r andean 
assignment  does  not  exactly  reproduce  the  cell  means  as 
required  from  the  first  link  of  the  simulation;  however, 
we  recal  ?.  that  the  cell  means  produced  by  the  parameter 
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estimation  process  are  themselves  only  estimates  of  the 

true  values.  Moreover,  we  are  not  likely  to  analyze  the 

simulation  output  for  a  single  cell  (unless  it  is  a  very 

large  cell),  but  for  groups  of  cells,  and  the  larger  the 

number  of  probabilities  in  the  cell  or  groups  of  cells, 

the  more  likely  is  the  assignment  process  to  reproduce 

closely  tne  mean  probability  required  from  the  first  link 

2 

of  the  simulation. 

Thus,  upon  completion  of  the  second  link  of  the  simu¬ 
lation,  we  have,  for  each  vehicle,  probabilities  of  expo¬ 
sure  assigned  to  each  cell  of  the  population  in  such  a 
way  as  to  closely  reproduce  the  cell  mean  audience  for 
the  vehicle.  These  probabilities,  however,  have  not  yet 
been  assigned  to  specific  individuals  within  the  population 
cells.  In  making  this  assignment  we  must  consider  the 
audience  duplication  between  vehicles. 


2 

While  writing  this  description,  an  improvement  to 
this  assignment  process  becomes  evidient.  In  addition  to 
recalculating  after  each  draw  the  average  required  for 
the  remaining  probabilities  one  could  also  recalculate 
the  average  for  each  distribution  after  the  probability 
drawn  has  been  deleted.  Then  the  new  distribution  means 
and  cell  mean  would  govern  the  choice  of  the  distribution 
from  which  the  next  probability  is  to  be  drawn.  Also, 
the  boundaries  between  the  distributions  could  be  defined 
by  some  sort  of  weighted  average  of  the  mean  distribution 
probabilities.  Finally,  we  note  that  the  cells  are  pro¬ 
cessed  in  their  logical  order.  Perhaps  it  would  be  better 
to  process  them  according  to  size,  from  largest  to  small¬ 
est,  since  the  smaller  cells  are  not  usually  as  important 
and  since  the  few  draws  required  to  fill  them  will  not 
afford  much  opportunity  for  recalculation  of  means  and 
correction  anyway. 


206 


Duplication  in  the  Vehicle  Audiences 

As  we  have  mentioned  before,  there  are  two  important 
ways  in  which  we  specify  structure  in  the  mass  media  simu¬ 
lation.  The  first  of  these  is  in  the  organization  of  the 
probabilities  of  exposure  for  a  particular  vehicle.  The 
differences  in  these  probabilities  result  in  some  people 
being  almost  constantly  exposed  to  the  vehicle,  while 
other  people  are  almost  never  exposed  to  the  vehicle. 

These  two  extremes  combine  with  people  who  cure  sometimes 
exposed  to  the  vehicle  to  produce  a  net  audience  which  is 
just  equal  to  the  mean  audience  figures  which  we  have  ar¬ 
rived  at  from  our  survey  data  and  other  considerations. 

The  second  way  in  which  structure  is  programmed  into 
the  simulation  is  through  the  duplication  among  the  vehicle 
audiences.  Time  and  time  again,  audience  studies  have 
shown  that  those  people  who  have  a  high  probability  of 
exposure  to  one  vehicle  generally  have  a  high  probability 
of  exposure  to  other  vehicles;  that  in  fact  there  exists 
a  syndrome  which  we  might  label  the  high  media-consumption 
syndrome.  Conversely,  there  also  exists  a  low  media- 
consumption  syndrome;  that  is,  there  are  individuals  who 
have  very  low  probabilities  of  exposure  to  any  and  all 
of  the  vehicles  in  the  mass  media.  Therefore,  just  as 
there  are  individuals  who  almost  always  see  every  issue 
or  broadcast  of  a  given  vehicle,  there  cure  also  individuals 


and  to  a  large  extent,  the  same  individuals — who  are  high¬ 
ly  exposed  to  a  wide  range  of  vehicles.  Even  among  those 
individuals  who  are  exposed  to  a  wide  range  of  vehicles, 
however,  we  often  find  relatively  low  duplication  between 
two  vehicles  which  serve  more  or  less  equivalent  functions, 
e.g.,  Newsweek  and  Time  magazines.  Since  the  contents  of 
these  magazines  tend  to  be  quite  similar,  a  reader  of  one 
igazine  is  not  likely  to  be  a  reader  of  the  other,  even 
chough  the  demographic  characteristic  of  the  two  audiences 
may  be  quite  similar. 

The  simulation  accounts  for  this  non-random  duplica¬ 
tion  between  vehicle  audiences  in  two  ways.  The  first 
involves  the  cell  mean  probabilities  of  exposure  for  each 
population  type  for  each  vehicle.  The  cell  means  cause 
non-random  duplication  in  the  following  manner:  assume 
that  certain  people  are  more  likely  to  be  exposed  to 
evening  news  broadcasts  than  are  other  people  and  that 
these  are  people  of  higher  education.  If  these  people 
are  also  more  likely  to  be  exposed  to  front-page  newspaper 
news,  then  the  cell  means  for  people  of  higher  education 
will  be  higher,  both  for  evening  news  broadcasts  and  for 
newspaper  readership.  The  product  of  these  two  probabil¬ 
ities,  which  in  general  will  be  large  because  each  of  the 
two  probabilities  is  itself  high,  gives  the  expected 
audience  duplication  if  the  assignment  of  probabilities 
for  the  two  vehicles  reproduces  the  cell  means  and 
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otherwise  is  done  randomly  across  members  of  that  particu¬ 
lar  cell.  Thus,  the  demographic  d_stribution  of  the  audi¬ 
ences  of  the  several  vehicles  does  itself  account  for  much 
of  the  non-random  duplication  of  audiences  among  those 
vehicles.  In  fact,  in  most  audience  studies,  the  factors 
which  are  used  to  explain  the  phenomenon  of  wide  media  ex¬ 
posure  among  a  given  group  of  individuals  are  generally  the 
demographic  and  sociological  c  aracteristics  of  that 
group.  For  the  case  of  non-random  duplication  which  is 
not  accounted  for  by  the  cell  means,  we  must  make  a  non- 
random  assignment  of  probabilities  to  individuals  within 
cells. 

Within-Cell  Non-Random  Probability  Assignments 

The  third  and  fourth  links  of  the  simulation  perform 
the  calculations  which  enable  the  simulation  to  take  some 
account  of  duplication  which  is  not  accounted  for  by  the 
demographic  characteristics  as  implied  in  the  cell  means. 
For  those  vehicles  for  which  the  duplicated  audience  is 
known  or  assumed,  that  audience  is  entered  into  the  simu¬ 
lation,  and  a  matrix  of  the  duplicated  audiences  between 
every  possible  pair  of  vehicles  is  produced.  From  the 
cell  means  produced  by  the  first  link  of  the  simulation, 
we  also  generate  a  corresponding  matrix  of  expected  dup¬ 
lications,  giving  random  wi thin-cell  assignments  of  prob¬ 
abilities.  Corresponding  elements  of  the  two  matrices 
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of  random  arid  real-world  duplications  among  vehicles  are 
then  compared/  and  if  a  difference  of  more  than  five  per¬ 
cent  of  the  real-world  figure  is  found,  the  square  of 
this  difference  is  entered  in  a  third  matrix  of  squared 
differences  of  duplications.  For  those  vehicles  about 
which  there  is  no  a  priori  assumption  about  duplication 
or  real-world  duplication  information,  the  value  entered 
into  the  third  matrix  is  0.0.  We  have  produced,  then,  a 
matrix  of  the  squared  differences  between  the  known  or 
assumed  empirical  duplication  and  the  random  (at  the  cell 
level)  duplication  implied  by  the  simulation. 

Any  correction  we  make  now  for  the  duplication  should 
in  theory,  be  made  simultaneously  for  all  vehicles  having 
significant  non-random  duplication,  i.e.,  it  is  not  suf¬ 
ficient  to  correct  the  duplication  between  vehicle  A  and 
vehicle  B,  and  then  between  vehicle  A  and  vehicle  C, 
since  duplications  between  A  and  B,  A  and  C,  and  B  and  C, 
all  may  require  correction.  We  have  not  been  able  to 
discover  an  algorithm  which  would  allow  the  correction  of 
duplication  for  all  the  possible  cases  simultaneously. 

We  use  instead  a  correction  which  will  consider  up  to 

five  vehicles  at  a  time.  The  algorithm,  suggested  by 

3 

Robert  P.  Abelson,  groups  the  vehicles  into  overlapping 
sets  of  five  such  that  each  set  is  closely  bound  together 

3 

Robert  P.  Abelson,  Department  of  Psychology,  Yale 
University,  July  25,  1964,  personal  letter. 
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in  the  sense  that  the  duplications  among  the  five  vehicles 
are  quite  non-random.  It  chooses  ao  the  first  four  ve¬ 
hicles  in  the  first  set,  those  four  for  which  the  sums  of 
the  squared  differences  between  the  real  world  and  expec¬ 
ted  (based  on  audience -derived  cell  means)  duplication 
over  all  vehicles  in  the  matrix,  are  largest.  The  fifth 
vehicle  in  the  set  is  that  vehicle  for  which  the  sum  over 
the  first  four  vehicles  is  largest,  i.e.,  the  vehicle  with 
the  largest  interaction  with  the  first  four.  To  form  the 
next  group  of  five,  the  algorithm  finds  which  of  the  re¬ 
maining  vehicles  has  the  largest  interaction  with  the  first 
group.  It  then  removes  from  the  group  the  vehicle  with 
the  smallest,  interaction  with  the  newly  identified  vehicle 
and  adds  the  latter,  forming  a  new  group  of  five  vehicles. 
Thus,  we  have  a  second  set  of  five  vehicles,  four  of 
which  were  included  in  the  first  set  of  five  vehicles. 

We  proceed  in  this  way  until  all  the  vehicles  for  which 
there  is  duplication  information  are  included  in  a  set  of 

4 

linked  vehicles.  ' 

The  fourth  link  of  the  simulation  continues  calcula¬ 
tion  for  the  assignment  of  probabilities  in  order  to  re¬ 
produce  non-random  duplication.  At  first  the  vehicles 
are  treated  in  pairs,  each  pair  consisting  of  two  of  the 

4 

This  algorithm  is  discussed  in  detail  by  Herbert  J. 
Selesnick  in  Link:  A  Program  that  Groups  Media  Among 
Which  There  Exists  a  Non-Random  Duplication,  COMCOM/ 
Simulation  Memo  #17,  August  26,  1964. 
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vehicles  included  in  one  of  the  sets  of  five  linked  ve¬ 
hicles.  Again  we  treat  the  problem  in  the  expected  value 
sense  and  ask:  How  shall  we  make  the  assignment  of  prob¬ 
abilities  for  the  two  vehicles  so  as  to  reproduce  the 
known  duplication?  For  each  member  of  the  population, 
the  pair  of  probabilities  for  the  two  vehicles  can  be 
categorized  by  its  distribution  (high,  middle,  low)  on 
vehicle  A  and  vehicle  B:  therefore,  there  sure  nine  pos¬ 
sible  categories  of  pairs  ranging  from  low  distribution 
probabilities  for  both  vehicle  A  and  vehicle  B.  Figure 
VI-1  below  indicates  the  nine  possibilities. 

Distribution  on  Vehicle  A 


Distribution 

on 

Vehicle  B 


High 

Middle 

Low 

High 

X11 

X12 

X13 

Middle 

X21 

X22 

X23 

Low 

i 

X31 

X32 

X33 

Figure  VI-1.  Logical  Distribution  Types  for  Pairs  of 
Probabilities . 

If  v/e  let  x^j  represent  the  number  of  people  (or  assign¬ 
ment  pairs)  in  the  ij**  cell,  let  PL^,  PMA,  PHA 
represent  the  average  probability  of  the  low,  middle  and 
high  distributions  of  vehicle  A,  and  PLfi,  PMg,  ?Hfi  repre¬ 
sent  the  corresponding  averages  for  vehicle  B,  then  the 
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expected  audience  duplication  is  given  by  the  equation: 
Expected  Duplication  =  xh*pha*phb  +  X12*PMA*PHB 

+  X13*PLA*PHB  +  X21*PHA*PMB 

+  X22*PMA*PMB  +  X23‘PLA*Pi4B 
+  X31*PHa*PLb  +  x32*PMa*PLb 

+  X33  *PLA*PLB 

In  addition,  the  total  number  in  each  distribution  .r 
each  vehicle  is  known,  giving  five  independent  equations 
of  the  form 

Number  in  High  Distribution  for  Vehicle  A  = 

+  x„,  +  x. 


21 


31* 


Any  solution  of  these  six  equations  involving  the 
number  of  probability  pairs  wi.ll  produce  the  desired  cum¬ 
ulation  between  two  vehicles.  It  is  possible  that  there 
is  no  solution  to  these  equations,  i.e.,  that  the  audi¬ 
ence  duplication  required  empirically  is  either  too  large 
or  too  small  to  be  reproduced  in  this  expected  value 
sense,  given  the  distribution  means  and  sizes  for  the  L'.*o 
vehicles.  However,  this  would  be  unlikely  if  the  means 
of  the  low  and  high  distributions  were  quite  different 
for  both  of  the  vehicles.  In  any  event,  a  calculation  is 
made  to  determine  the  maximum  and  minimum  values  of  the 
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duplication  which  can  be  reproduced  by  the  distributions. 

The  calculation  is  made  according  to  the  following  theorem: 

Theorem:  Given  that  we  have  two  sets  of  N  positive 
numbers.  Consider  the  sum  of  the  N  products  formed 
by  multiplying  a  number  from  one  set  by  a  number  from 
the  other  set.  (Each  number  is  included  in  only  one 
product.)  Then  the  largest  value  of  the  sum  is 
a  hieved  in  the  following  way:  multiply  the  highest 
number  in  the  first  distribution  by" ’the  highest  num¬ 
ber  in  the  second  distribution,  the  second  highest 
number  in  the  first  distribution  by  the  second  high¬ 
est  number  in  the  second  distribution,  etc.  The 
lowest  value  for  the  summed  products  is  formed  by 
multiplying  the  highest  number  in  the  first  distribu¬ 
te  on  by  the  lowest  number  in  the  second  distribution, 
the  second  highest  number  in  the  first  distribution 
by  the  next-to- lowest  number  in  the  second  distribu¬ 
tion,  etc. 

Using  this  theorem,  the  calculation  of  the  largest 
and  smallest  values  of  the  duplication  which  can  possibly 
be  reproduced  is  straightforward,  given  the  means  and 
sizes  of  the  three  distributions  of  each  of  the  two  ve¬ 
hicles.  If  we  find  that  the  actual  duplication  falls 
outside  this  range,  the  values  for  the  number  of  people 
in  the  ninefold  categories  are  set  at  those  values  which 
come  closest  to  reproducing  the  empirical  duplication, 
e.g.,  if  the  empirical  duplication  is  too  high  to  be  re¬ 
produced,  the  values  of  the  are  set  (according  to  the 
theorem  above)  to  produce  the  maximum  possible  duplica¬ 
tion.  If  the  empirical  duplication  falls  within  the 
range  of  possible  duplications,  then  a  linear  programming 
algorithm  is  simply  used  to  find  a  feasible  solution  to 
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the  set  of  five  equations  and  nine  unknowns.  Thus,  for 
each  pair  of  linked  vehicles,  a  table  of  the  nine  is 
produced . 

Just  as  there  are  nine  probability-pair  types  for 

each  pair  of  vehicles,  there  are  for  each  group  of  five 

5 

vehicles  (each  vehicle  having  three  distributions)  3  or 
243  possible  probability-quintuple  types  based 

on  the  probability  distribution  for  each  vehicle.  If  we 


can  calculate  values  for  the  243  X. ,  which  are  consis- 

ijklm 

tent  with  the  nine  x. .  for  each  of  the  ten  pairs  of  ve- 

13  c - 

hides  in  a  set  of  five  vehicles  (given  that  the  duplica¬ 


tion  information  was  there  for  all  pairs  of  the  vehicles)  , 


then  an  assignment  of  probabilities  according  to  the 


243  would,  on  the  average,  exactly  account  for  all 

the  pairwise  duplication  in  the  set  of  five  vehicles.  By 
now  it  should  be  clear  how  we  go  from  a  set  of  sobtables 
to  the  overall  grand  table;  we  use  the  smoothing  iteration 
to  estimate  the  frequencies  in  the  five-dimensional  table 
from  the  ten  possible  two-dimensional  tables.  It  some¬ 
times  happ  .ns ,  in  per f oiming  this  iteration ,  that  the  ten 
two-dimensional  tables  have  inconsistencies.  These  in¬ 
consistencies  do  not  necessarily  arise  because  the  data 
is  inconsistent,  but  because  the  particular  feasible 
solutions  arrived  at  by  the  linear  programming  algorithm 
are  inconsistent.  At  any  rate,  it  is  sometimes  necessary 
to  omit  one  or  more  of  the  two-dimensional  tables  in 


order  that  the  iteration  converges.  Then  the  output  from 
this  fourth  link  of  the  simulation  is  the  proportion  of 
the  probability-quintuple  assignments  (or  of  the  popula¬ 
tion)  which  falls  into  each  of  the  243  types,  for  each 
set  of  five  vehicles.  This  information,  in  the  form  of 
percentages,  is  then  passed  on  to  the  fifth  link  of  the 
simulation  which  makes  the  actual  assignments  of  probabil¬ 
ities  to  individuals. 

The  fifth  link  of  the  simulation  does  the  actual  as¬ 
signment  of  probabilities  to  individuals  for  all  the  ve¬ 
hicles  in  the  media  system.  The  probabilities  for  those 
vehicles  which  are  not  involved  in  any  way  with  other  ve¬ 
hicles,  i.e.,  those  vehicles  which  do  not  appear  in  any 
of  the  linked  groups  either  because  they  have  random  dup¬ 
lication  with  other  vehicles  or  because  there  are  no  data 
about  the  duplication,  are  assigned  randomly  to  individ¬ 
uals  within  each  cell.  Also,  the  first  probabilities  for 
the  first  vehicle  of  the  first  group  of  five  linked  ve¬ 
hicles  are  assigned  randomly.  For  the  second  vehicle,  we 
proceed  in  the  following  fashion:  the  243  percentages 
are  summed  over  the  three  vehicles  in  which  we  are  not 
yet  concerned  to  get  the  nine  percentages,  (just  the  nine 
for  the  first  two  vehicles  of  the  group,  expressed  as 
percentages) ,  which  describe  the  required  probability- 
pair  types  for  the  first  and  second  vehicles.  These  x^ 
express  the  relationship  between  the  two  vehicles 
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required  in  the  entire  population  to  reproduce  the  empi¬ 
rical  duplication.  In  attempting  to  reproduce  this  dup¬ 
lication  for  the  cell,  however,  we  are  constrained  by  the 
limited  number  of  people  in  any  cell.  Often,  we  may  have 
even  fewer  people  in  the  cell  than  we  have  probability 
assignment  types,  especially  when  the  number  of  types  is 
243.  In  this  case,  we  obviously  cannot  reproduce  the  re¬ 
quired  assignments ;  we  can  only  approximate  it. 

Also,  the  probabilities  for  each  of  the  vehicles 
have  already  been  allocated  to  the  cell.  Fcr  any  vehicle, 
the  probability  distribution  at  the  cell  level  will  not, 
in  general,  be  identical  with  the  distribution  for  the 
entire  population  which  was,  of  course,  used  An  generat¬ 
ing  Ahe  243  We  attempt,  at  the  cell  level,  to 

approximate  the  percentages  of  the  243  types  by  performing 
the  smoothing  iteration  again.  The  subtables  used  in 
this  process  for  the  case  of  the  first  and  second  vehicles 
in  the  first  set  sure  simply  the  numbers  of  probabilities 
in  the  cell  from  each  distribution  for  each  of  the  two 
vehicles.  Thus,  we  have  two  three-value  tables  for  each 
vehicle  which  must  be  put  together  as  closely  as  possible 
to  resemble  the  ninefold  table  of  combinations  implicit 
in  the  243  percentages.  The  matrix  of  nine  combinations 
is  initialized  with  the  exact  values  implicit  in  the  243 
percentages  and  then  the  iteration  is  performed  to  make 
the  matrix  conform  to  the  two  three-value  marginals.  The 
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iteration  usually  produces  non-integral  values  for  the 
nine  types  which  must  be  made  integral  (since  one  can 
assign  only  integral  numbers  of  probabilities)  and  yet 
conform  to  the  marginals. 

First,  we  produce  integral  values  by  a  Monte  Carlo 
rounding  of  each  of  the  fractions.  Then,  we  check  each 
of  the  rows  and  columns  against  the  marginal  sums  and, 
if  errors  are  found,  we  add  or  subtract  from  the  column 
where  the  error  occurs  and  correct  the  corresponding  row 
value.  In  this  w.iy,  we  reproduce  as  closely  as  possible 
(given  the  constraints  of  the  probabilities  already  drawn 
for  the  individual  cell)  the  assignments  required  over 
the  entire  population  in  order  to  reproduce  the  empirical 
duplication.  This  process  is  extended  in  an  obvious  way 
for  duplication  between  the  first  and  second  vehicle  as 
a  pair  and  a  third  vehicle  in  the  group  of  five,  etc. 

The  assignment  process  must  at  many  points  be  sac¬ 
rificed  to  the  more  important  preservation  of  cell  means 
by  maintaining  the  probabilities  for  each  vehicle  within 
the  cells.  We  also  have  not  attempted,  at  any  point,  to 
look  at  individual  probabilities,  but  rather  have  treated 
distributions  of  probabilities  in  terms  of  their  known 
averages.  Thus,  the  assignment  process  cannot  be  expected 
to  fully  achieve  the  required  empirical  duplication;  how¬ 
ever,  it  does  make  an  attempt  to  bias  the  assignment  of 
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probabilities  in  the  direction  required  to  more  nearly 
approximate  the  pairwise  real-world  vehicle  duplication. 

Estimating  the  Empirical  Duplications 

We  do  not  know,  in  general,  how  much  of  the  duplica¬ 
tion  is  accounted  for  by  the  demographic  factors  and  how 
much  duplication  remains  due  to  other  factors  which  have 
not  been  specified  as  dimensions  of  the  simulation  popu¬ 
lation.  However,  the  simulation  itself  may  help  to  get 
a  firmer  grasp  of  the  magnitude  of  the  effects  of  the 
non-cell-definincfr  but  audience  influencing,  characteris¬ 
tics,  since  given  a  known  duplication  it  will  examine 
the  duplication  which  would  be  produced  by  random  within- 
cell  assignment,  and  measuring  the  difference  between  the 
random  assignment-produced  duplication  and  the  empirical 
duplication,  it  will  decide  whether  or  not  a  non-random 
assignment  is  required.  Therefore,  the  simulation  ac¬ 
tually  makes  a  decision,  where  the  empirical  duplication 
data  cure  available,  about  the  strength  of  the  effects  of 
the  non-cell-defining  variables.  In  the  next  section,  we 
estimate  duplication  of  audiences  for  those  vehicles  for 
which  we  can  make  reasonable  estimates  of  that  duplica¬ 
tion.  For  example,  the  duplication  among  the  three  daily 
newspapers  was  measured  to  some  degree  by  the  NORC  sur¬ 
vey  which  was  used  to  generate  the  original  audience  es¬ 
timates  for  those  vehicles.  On  the  other  hand,  for  the 
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radio  audiences  we  have  no  direct,  empirical  measurement 
of  the  audience  duplication.  There  do  exist  some  Hooper 
and  Nielsen  data  which  give  indications  of  likely  audience 
duplication  for  consecutive  programs  or,  for  example,  for 
programs  in  the  same  time  spot  on  weekday  evenings.  In 
these  cases,  we  will  attempt  estimates  of  the  audience 
duplication.  It  remains  to  be  seen  whether  these  dupli¬ 
cation  estimates  differ  substantially  enough  from  the 
duplication  implied  by  the  cell  means  to  require  the  simu¬ 
lation  to  actually  make  non-random  probability  assign¬ 
ments  within  cells.  As  for  the  duplication  of  audiences 
between  the  newspapers  and  the  various  radio  broadcasts, 
there  seems  to  be  very  little  data  other  than  the  general 
knowledge  that  people  who  are  highly  exposed  to  one  ve¬ 
hicle  and  in  particular  specific  kinds  of  news  in  one 
vehicle,  are  quite  likely  to  be  exposed  to  other  vehicles, 
and  that  same  kind  of  news  in  other  vehicles.  However, 
as  was  stated  above,  there  is  little  reason  to  believe 
that  these  duplication  effects  are  not  rather  well  ex¬ 
plained  by  the  population  dimensions,  i.e.,  the  demo¬ 
graphic  and  sociological  characteristics  of  the  audience, 
rather  than  by  factors  which  have  not  been  considered. 
Therefore,  in  these  cases,  we  will  not  attempt  to  derive 
an  audience  duplication.  Let  us  proceed  now  to  the  audi¬ 
ence  duplication  among  the  three  weekday  newspapers. 
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Audience  Duplication  Among  the  Three  Daily  Newspapers 

It  will  be  recalled  from  Chapter  III  (p.  )  that 

the  answers  to  the  NORC  question  concerning  readership 
of  the  daily  newspapers  are  to  be  interpreted  as  follows: 
a  "no"  answer  for  any  paper  is  considered  to  be  exactly 
that,  i.e.,  no  readership.  However,  a  "yes"  answer  is 
taken  to  mean  a  probability  of  0.8855  of  exposure  to 
that  newspaper  on  that  given  day.  From  this  and  the 
duplication  and  triplication  as  measured  by  the  NORC 
survey,  we  are  able  to  arrive  at  the  duplication  between 
any  two  of  the  daily  newspapers.  For  example,  of  those 
people  who  claim  exposure  to  both  the  Post  and  Times -Star , 
we  consider  that  a  proportion  equal  to  the  square  of  the 
probability  of  exposure  were  actually  exposed  to  both 
newspapers  on  any  two  random  occasions.  There  is  also, 
however,  another  group  of  people  who  may  have  been  ex¬ 
posed  to  both  newspapers ,  namely  those  people  who  claim 
exposure  to  all  three  newspapers  but  who,  according  to 
our  probability  measure,  were  exposed  to  only  two  of  the 
three.  This  is  the  number  claiming  exposure  to  three 
newspapers  multiplied  by  the  square  of  the  probability 
and  by  the  factor  1  minus  the  probability.  By  this 
method,  we  have  estimated  the  probability  of  exposure  to 
any  two  newspapers  (see  Table  VI-1) . 

We  note  that  the  corrected  duplications  are  nearly 
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Table  VI-1.  NORC  and  Corrected  Empirical  Audience  Dupli¬ 
cation  Between  Pairs  of  Daily  Newspapers  as 
Percentages  of  the  Total  Population. 


Duplication 

Newspaper  Pair 

Enquirer- 

Post 

Enquirer- 

Times-Star 

Post- 

Times-Star 

Raw  NORC 

9.62% 

11.91% 

5.04% 

Corrected  by 
the  Probability 
Factor  .8855 

9.52 

10.60 

5.22 

equal  to  the  raw  duplications.  Although  the  raw  duplica¬ 
tions  were  somewhat  diminished  (by  a  factor  of  about  78 
percent)  by  our  correction,  it  was  also  increased  by  the 
approximately  9  percent  of  those  who  claimed  reading  of 
all  three  newspapers,  but  who  were  assigned  to  the  cate¬ 
gory  of  readers  of  only  two  of  the  newspapers.  Thus,  we 
see  that  in  fact  the  duplication  for  the  Post  and  Times- 
Star  has  been  slightly  increased  from  the  raw  duplication. 
In  passing,  we  note  also  that  the  smallest  duplication  is 
between  the  Post  and  Times-Star .  This  is  to  be  expected 
since  the  Post  and  Times-Star  are  both  evening  newspapers 
and  the  Enquirer  is  a  morning  newspaper. 

For  the  case  of  the  Sunday  Enquirer,  we  shall  make 
no  estimate  of  the  duplication  between  it  and  any  other 
vehicles  since  there  are  no  data  available.  It  would 
seem  reasonable  to  estimate  that  the  duplication  between 
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the  Sunday  Enquirer  and  the  daily  Enquirer  would  be  very 
high,  but  this  is  quite  likely  accounted  for  by  the  dupli¬ 
cation  as  produced  by  the  cell  means,  since  the  Sunday 
Enquirer  audience  was  projected  cell  by  cell  from  the 
audience  of  the  daily  Enquirer.  In  those  cells  in  which 
the  daily  Enquirer  has  a  high  audience,  the  Sunday  En¬ 
quirer  will  also  have  a  high  audience  and  a  relatively 
high  duplication  is  therefore  assured  with  a  random 
wi thin-cell  probability  assignment.  For  the  case  of  the 
other  media,  the  random  assignment  of  Sunday  Enquirer 
probabilities  of  exposure  will  result  in  a  rather  high 
duplication  because  of  the  fact  that  the  Sunday  Enquirer 
reaches,  by  our  best  estimate,  about  65  per  cent  of  the 
adult  population  on  an  average  issue.  This  seems  already 
to  insure  a  high  duplication  and  therefore,  no  other  ad¬ 
justments  will  be  made. 

Duplication  Among  the  Radio  Audiences 

The  data  relating  to  audience  duplication  for  radio 
is  very  poor.  One  of  the  sources  of  data  is  derived  from 
the  Nielsen  Radio  Index  as  cited  in  Sandage.^  These  data 
indicate  the  proportion  of  the  total  audience  of  a  pro¬ 
gram  which  listens  throughout  95  per  cent  of  the  program. 
For  the  evening,  this  proportion  ranges  from  29  per  cent 
for  a  band  concert  to  76  per  cent  for  a  leading  variety 

^C.  H.  Sandage,  Radio  Advertising,  p.  147. 


show.  For  the  daytime,  the  percentages  range  from  50  per 
cent  for  a  low-rated  serial  to  78  per  cent  for  a  high¬ 
rated  serial.  For  two  daytime  news  programs,  54  and  69 
per  cent  were  rather  constant  continuing  listeners  to  the 
programs.  One  other  piece  of  data  cited  by  Nxelsen  in¬ 
dicates  that  the  duplicated  audience  of  two  programs  on 
different  networks  for  the  same  evening  was  approximately 
30  per  cent  of  the  audience  of  the  smaller  program.  Also 
for  two  consecutive  evening  programs,  the  audience  dupli¬ 
cation  was  71  per  cent  of  the  audience  of  the  smaller 
6 

programs . 

These  data,  however,  are  not  exactly  what  we  need 
since  the  percentages  indicate  the  proportion  of  people 
who  stay  with  one  program  or  with  both  of  two  programs; 
in  the  case  of  our  radio  news  broadcast  vehicles  over  all 
of  the  five  radio  stations,  we  would  like  to  know  the 
number  of  people  who,  given  that  they  listen  at  one  point 
in  time,  are  also  listening  one  hour  later.  These  people 
generally  will  be  exposed  then  to  news  separated,  on  the 
average,  by  a  one  hour  time  interval.  We  do  not  wish  to 
take  account  of  switching  from  station  to  station  since 
in  general  the  news  broadcasts  occur  on  the  hour  over 
every  station.  We  want  the  constancy  of  listenership 
over  hours,  not  over  stations.  Therefore,  the  percentages 

^A.  C.  Nielsen,  New  Facts,  p.  55. 
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quoted  from  Sandage  underrate  the  constancy  of  listening 
to  the  kinds  of  broadcasts,  i.e.,  generic  radio  news¬ 
casts,  about  which  we  are  concerred. 

Since  the  data  are  so  poor,  we  make  an  arbitrary 
decision  as  to  how  much  duplication  exists  between  audi¬ 
ences  of  two  consecutive  programs.  We  estimate  that  the 
duplication  between  any  two  programs  is  50  per  cent  of 
the  smaller  of  the  two  audiences.  Using  the  smaller  of 
the  two  audiences  as  the  base  takes  account  of  the  in¬ 
flux  into  the  audiences  of  people  arising  in  the  morning 
and  also  the  influx  of  people  (men)  returning  from  work 
during  the  hours  from  five  to  seven.  The  rationaliza¬ 
tion  is  that  those  who  are  in  the  audience  at  7:00  A.M. 
cure  rather  likely  to  be  in  the  audience  at  8:00  A.M.  and 
those  who  are  in  the  audience  at  5:00  P.M.  are  rather 
likely  to  be  in  the  audience  at  6:00  P.M.  This  somewhat 
arbitrary  formula  defines  the  audience  duplication  for 
the  radio  audiences.  As  we  noted  above,  we  will  not 
attempt  any  estimate  of  audience  duplication  between 
radio  and  newspaper  audiences  and  we  note  that  our  only 
estimate  of  audience  duplication  for  radio  occurs  be¬ 
tween  consecutive  radio  broadcasts. 

Let  us  also  note  here  the  implications  for  dupli¬ 
cation  of  our  treating  of  weekday  radio  broadcasts  as 
one  group  of  broadcasts  which  apply  to  any  weekday. 

This  means,  for  example,  that  we  do  not  actually  have 
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five  assignments  of  probabilities  for  the  8:00  A.M.  news 
Monday  to  Friday,  but  that  one  assignment  of  probabil¬ 
ities  holds  for  each  of  three  days.  Since  the  process 
is  a  probability  process,  this  does  not  mean  that  a  per¬ 
son  who  is  exposed  on  one  weekday  will  necessarily  be 
exposed  on  another  weekday,  (given  the  proper  messages) 
unless  the  probabilities  are  identically  1.0,  which  is 
a  very  rare  case.  However,  the  distribution  of  exposures 
within  the  cell  will  not  be  changing  from  weekday  to 
weekday,  but  will  remain  rather  constant.  We  believe 
that  this  adds  a  considerable  amount  of  structure  to  the 
simulation.  Since  peoples'  weekday  habits  tend  to  be 
rather  constant,  this  added  structure  appears  plausible. 

The  audience  duplications  between  pairs  of  consecu¬ 
tive  radio  broadcasts  are  given  in  Table  VI-2. 

Upon  completion  of  the  fifth  link  of  the  simulation, 
the  media  system  has  been  couple tely  defined.  We  may 
conceive  of  the  media  system  in  the  following  schematic 
way.  We  form  a  matrix  in  which  the  entries  in  the  left 
hand  side  are  the  identification  numbers  for  each  of  the 
hypothetical  members  of  the  population.  In  the  first 
columns  of  the  matrix  are  entered  for  each  member  of  the 
population  the  ceil  number  or  equivalently,  the  category 
or  level  for  each  of  the  population  dimensions.  In  the 
next  columns  are  entered,  for  each  vehicle  included  in 
the  media  system,  the  individual's  probability  of 
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exposure  to  that  vehicle  and  the  distribution  from  which 
this  probability  was  drawn.  The  schematic  diagram  is 
shown  in  Figure  VI-2  below. 


I.D.  Nos. 
of  the 
Population 


3000  people 


Various  attri¬ 
butes  of  each 
member  of  the 
population 


Probabilities  of 
exposure  to  each 
vehicle  and  for 
each  vehicle  the 
distribution  in 
which  the  person 


Figure  vI-2.  Schematic  Diagram  of  the  Media  System  and 
Population  Upon  Completion  of  the  First 
Five  Links  of  the  Simulation 
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CHAPTER  VII 


THE  CONTENT  ANALYSIS  AND  PREPARATION  OF  MESSAGE  SCENARIOS 


The  final  two  links  of  the  simulation  irLroduce 
the  actual  scenario  of  messages  ordered  by  themes  and  time 
periods  and  cycle  through  the  population,  testing  to  see 
whether  each  individual  is  exposed  to  the  messages  and  re¬ 
porting  for  each  theme,  time  period,  and  media  type,  such 
statistics  as  the  number  of  exposures  and  cumulative  number 
of  individuals  exposed  by  media  type. 

The  first  step  in  exposing  the  members  of  the  hypo¬ 
thetical  population  to  messages  appearing  in  the  media  sys¬ 
tem  is  the  identification  of  kinds  of  messages  of  interest 
to  the  researcher  followed  by  a  content  analysis  of  the 
media  system,  in  order  to  locate  the  particular  messages 
in  the  vehicle  at  a  particular  point  of  time  and  with  a 
given  format.  Therefore,  we  turn  now  to  the  content  analy¬ 
sis  of  the  Cincinnati  mass-media  system  during  the  six- 
month  interval  from  September,  1947,  to  March,  1948. 
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Themes  Coded  for  Content  Analysis 


In  the  literature  reporting  on  the  Cincinnati 
campaign,  the  major  theme  has  to  do  with  information 
about  the  United  Nations.  This  is  only  natural,  since 
the  funding  for  the  campaign  and  the  purpose  of  the 
campaign  were  explicitly  to  change  levels  of  informa¬ 
tion  and  opinions  about  the  United  Nations.  For  this 
reason,  we  initially  began  the  study  and  simulation 
with  the  hope  of  using  the  panel  data  on  attitudes 
about  the  United  Nations  as  the  major  validation  for 
the  simulation.  However,  when  we  look  at  the  results 
of  the  surveys,  we  find  that,  in  fact,  there  was  very 
little  change  in  information  or  attitudes  about  the 
United  Nations  in  the  six  month  period.  Ft>r  example, 
the  proportion  of  the  population  unfamiliar  with  the 
main  purpose  of  the  United  Nations  organization  de¬ 
creased  from  30  percent  in  September,  1947,  to  28  percent 
in  March,  1948.  The  proportion  of  people  who  had  heard 
or  read  anything  about  the  veto  power  in  the  United 
Nations  increased  by  only  3  percent  from  34  percent 
in  September  to  37  percent  in  March.  Although  the 
level  of  information  about  the  United  Nations  (as  mea-  . 
sured  by  the  NORC  questions)  had  not  changed  significantly 
during  the  six  months,  the  opinions  expressed  about  the 
United  Nations  did  change  somewhat.  For  instance,  dis¬ 
satisfaction  with  the  United  Nations  increased  from  28 


percent  of  the  sample  in  September  to  33  percent  of  the 
sample  in  March.  Similarly,  among  those  queried,  62  per¬ 
cent  felt  in  September  that  the  United  Nations  would 
succeed  while  only  48  percent  felt  this  way  in  March. ^ 

Therefore,  there  were  some  net  changes  in  opinions 
about  the  United  Nations;  moreover,  if  we  look  at  the 
turnover  within  various  subgroups  of  the  population,  the 
changes  will  be  of  differing  magnitudes  in  different 
groups  and  some  changes  will  be  larger  than  those  shown 
in  the  population  as  a  whole.  Therefore,  in  general, 
we  will  be  able  to  correlate,  or  attempt  to  correlate, 
frequencies  of  exposures  in  subgroups  of  the  population 
to  various  themes  with  the  turnover  or  changes  in  opinion 
during  the  six  month  period.  However,  it  should  be  noted 

a 

that  these  changes  are  not  the  largest  changes. 

By  far  the  most  important  changes  in  opinion 
during  the  six  months  had  to  do  with  the  kinds  of  problems 
which  were  facing  the  world  and  the  nation  during  this 
time.  The  main  concerns  were  problems  of  peace  or  war 
and  the  relations  of  the  United  States  vuth  Russia.  Fcr 
example,  when  the  sample  was  asked  to  think  of  the  prob¬ 
lem  facing  the  United  States,  24  percent  in  September 
and  46  percent  in  March  first  mentioned  the  possibility 

These  data  are  taken  from  NORC  Report  No.  37 A, 
Cincinnati  Looks  Again.  The  percentages  are  of  those  who 
have  some  minimal  knowledge  of  the  U.N.  People  who  knew 
nothing  about  che  U.N.  were  not  asked  these  questions. 
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of  another  war  or  the  problems  of  maintaining  the  peace. 
Sixteen  percent  in  September  and  29  percent  in  March  first 
mentioned  relations  with  Russia.  Only  1  percent  in  September 
and  2  percent  in  March  mentioned  the  United  Nations  in  answer 
to  this  question.  Another  question  asked  about  the  kinds  of 
problems  in  which  the  respondent  takes  a  keen  interest.  In 
September,  54  percent  of  those  asked  expressed  a  keen  inter¬ 
est  in  our  relations  with  Russia;  in  March,  this  was  68  per¬ 
cent.  In  September,  51  percent  expressed  a  keen  interest 
in  the  control  of  the  H-bomb;  in  March,  this  was  56  percent. 
When  asked  if  they  expected  the  United  States  to  fight  in 
another  war  within  the  next  ten  years,  in  September  48  per¬ 
cent  answered  "yes, "  and  in  March  73  percent  answered 
"yes".  Thus,  it  seems  clear  that  changes  in  public  opinion 
on  the  issues  of  war  and  peace  and  relations  with  Russia 
and  possibly  on  the  issue  of  the  control  of  the  atomic 
bomb  were  much  greater  and  more  important  than  changes 
in  opinion  about  the  United  Nations.  Therefore,  in  addi¬ 
tion  to  coding  the  newspaper  stories  in  terms  of  themes 
related  to  the  United  Nations,  we  also  coded  these  stories 
for  items  related  to  war,  violence  and  threats  to  peace, 
to  the  United  States'  relations  with  Russia,  and  to  con¬ 
trol  of  the  atomic  bomb.  In  fact,  it  will  be  shown  below 
that  the  largest  number  of  items  related  to  matters  of 
war  and  peace. 

The  number  of  themes  used  in  the  content  analysis 
totaled  17  including  chemes  having  to  do  with  U.N.  peace¬ 
keeping  or  the  veto  power  in  the  United  Nations,  the  U.N. 


and  human  rights,  etc.  With  these  kinds  of  themes,  it  is 
obvious  that  a  single  news  item  might  carry  more  than  one 
theme,  e.g.,  the  story  on  the  Palestine  debate  in  the 
United  Nations  might  emphasize  both  the  peacekeeping  role 
of  the  United  Nations  (Theme  1)  and  also  the  dissension 
and  dispute  among  the  great  powers  (Theme  2).  In  this 
case,  the  story  would  be  recorded  for  both  themes. 

The  themes  chosen  were,  therefore,  related  to  the 
questions  asked  in  the  NORC  survey.  It  was  felt  that 
these  themes  more  than  any  of  the  others  in  the  press  at 
this  time,  would  be  likely  to  explain  the  changes  in 
attitudes,  information,  and  opinions  during  the  six  month 
period  which  were  found  in  the  survey.  In  the  Appendix  to 
this  chapter,  we  present  the  themes  in  the  content  analysis, 
a  description  of  each  of  the  themes,  instructions  for  the 
coders,  the  kinds  of  newspaper  articles  which  might  be  in¬ 
cluded  as  relating  to  a  theme  and,  for  the  benefit  of  the 
coders,  a  list  of  questionnaire  items  and  the  results  from 
these  items  which  might  be  thought  to  be  closely  related 
to  each  of  these  themes.  Also  a  sample  coding  sheet  is 
included.  Since  there  exists  no  record  of  the  content  of 
radio  news  broadcasts  for  rhe  period,  the  coding  was  lim¬ 
ited  to  the  four  major  newspapers;  the  kinds  of  messages 
broadcast  will  be  inferred  from  the  treatment  in  the  news¬ 
papers  .  ~ 

Table  VII-1  shows  the  results  of  the  content  analy¬ 
sis  for  the  17  themes.  A  total  of  1,428  different  stories 
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were  coded  (including  the  stories  duplicated  for  the  relia¬ 
bility  estimation)  resulting  in  a  total  of  1,584  occurrences 
of  the  17  themes  in  the  four  newspapers.  Of  these,  the  most 
prominent  theme  by  far  was  the  theme  entitled  "Acts  of 
Violence,  Threats  to  Peace,  and  War."  Other  prominent 
themes  included  "U.N.  Peacekeeping,"  "Discussions  of 
Russian  and  American  Relations,"  "Dissension  Among  the 
Great  Powers  in  the  U.N."  and  "Mention  of  the  Organiza¬ 
tions  Sponsoring  the  Information  Campaign."  Given  the 
relative  number  of  occurrences  of  each  of  the  themes, 
it  is  not  surprising  to  find  that  fears  of  war  and  inter¬ 
est  in  Russian-American  relations  were  among  the  areas 
showing  the  greatest  change  in  the  panel.  With  reference 
to  the  negligible  change  in  information  concerning  the 
veto  power,  we  note  that  there  were  very  few  mentions  of 
the  veto  power  in  the  press,  (The  category  "Any  Reference 
to  the  Veto  Power, "  which  one  would  assume  would  be  larger 
than  the  following  two  categories  which  specify  something 
particular  about  the  veto  power,  actually  is  smaller  than 
the  other  two  categories.  This  probably  reflects  the  fact 
that  the  coder  had  a  much  easier  time  in  keeping  in  atten¬ 
tion  specific  kinds  of  coding  and  would  likely  code  one  of 
the  particular  kinds  of  veto  power.  Nonetheless,  even 
with  this  kind  of  inaccuracy,  the  mention  of  the  veto 
power.,  or  of  other  areas  of  concern  to  the  U.N.  were  very 
rare  in  the  press  at  this  time.)  Also,  we  note  that  there 
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Table  VII-1.  Content  Analysis  for  Seventeen  Themes  Occurring 
in  Cincinnati  Newspapers,  September  16,  1947  -  March  7,  1948. 


No.  of  News-  Est.  No.  of  Total 

Themes  Coded  paper  Occur-  Radio  Occur-  No.  of 

rences  rences  Occur¬ 

rences 
of  the 

_  Theme 


Themes  Relating  to  the  U.N, 


1.  U.N.  Peacekeeping 

234 

361 

595 

2.  Great  Power  Dissension 

in  the  U.N. 

108 

249 

357 

3.  Any  Reference  to  the  Veto  Power 

22 

32 

54 

4.  Veto  Only  in  Security  Council 

27 

63 

90 

5.  Veto  Only  by  Great  Powers 

28 

84 

112 

6.  U.N.  and  Human  Rights 

20 

12 

32 

7.  U.N.  and  World  Trade 

5 

11 

16 

8.  U.N.  and  World  Health 

9 

7 

16 

9.  UNESCO  and  Health 

14 

0 

14 

10.  Slogan  of  the  Information 

Campaign 

0 

1 

11.  Mention  of  the  Information 

Campaign  Sponsors 

100 

42 

142 

12.  Explicit  Explanation  of 

the  U.N. 

31 

12 

43 

13.  Satisfaction  with  the  U.N. 

19 

44 

63 

14.  Dissatisfaction  with  the  U.N. 

17 

58 

75 

Themes  Relating  to  Other  International  Issues 


15.  Control  of  the  Atomic  Bomb 

91 

129 

220 

16.  Russian-American  Relations 

17.  Violence,  Threats  to  Peace, 

217 

535 

752 

War 

643 

1559 

2202 

Total 

(514) a 

1584 

(1247) a 

(1761) 

aThe  simulation  allows  for  a  maximum  of  1800  occurrences  of 
any  single  theme;  therefore,  one  of  every  five  of  the  occur¬ 
rences  of  the  last  theme  were  omitted  in  constructing  the 
scenario  of  messages.  The  values  in  parentheses  are  the 
reduced  scenario  values. 
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were  very  few  expression's  of  satisfaction  or  dissatisfac¬ 
tion  with  the  United  Nations.  One  might  assume  that  in 
this  case  the  coders  failed  to  categorize  in  this  manner 
messages  which  implicitly  carried  notions  of  satisfaction 
or  dissatisfaction  with  the  U.N. ;  one  might  imagine  here 
that  discussions  of  great  power  dissension  in  +■'  2  U.N.  or 
reports  of  such,  carry  with  them  an  implied  attitude  of 
dissatisfaction  with  the  U.N.  The  coders,  however,  did 
not  make  this  assumption.  Finally,  we  note  that  there 
were  a  total  of  31  occurrences  in  the  newspaper  of  stories 
containing  some  explicit  explanation  of  the  purposes,  work¬ 
ings,  or  functioning  of  the  U.N.  It  seems  quite  obvious 
that  although  the  mention  of  meetings  of  the  sponsors  of 
the  campaign  were  often  carried  in  the  newspaper,  the 
campaign  did  not  succeed  in  significantly  changing  the 
number  of  stories  in  the  newspaper  which  conveyed  some 
explicit  explanation  of  the  United  Nations.  (Even  had 
they  been  able  to  do  so,  these  stories  would  not  have  been 
widely  read,  especially  by  those  who  had  little  knowledge.) 
This  makes  much  less  surprising  the  finding  of  very  small 
changes  in  opinion  and  attitudes  and  knowledge  about  the 
U.N.  over  the  six-month  period.  (Since  no  more  than  six¬ 
teen  messages  were  coded  for  themes  7,  8,  9,  or  10,  these 
themes  were  not  included  in  the  simulation  scenarios.) 

The  Cincinnati  campaign  may  not  have  demonstrated 
the  difficulty  in  reaching  certain  segments  of  the  popula¬ 
tion  with  information  about  public  affairs  (although  this 
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may  be  true  enough) ;  what  it  seems  actually  to  have  demon¬ 
strated  is  the  difficulty  in  presenting  information  and 
explicit  explanation  about  some  international  issue  in 
the  mass  media.  The  campaign  sponsors  thought  they  had 
done  this;  however,  the  content  analysis  seems  to  indicate 
that  they  did  not  succeed  nearly  as  well  as  they  had 
imagined  in  getting  the  information  into  the  mass  media. 

Because  it  is  only  possible  to  calculate  duplica¬ 
tion  between  any  theme  and  the  first  three  themes  of  the 
simulation,  the  order  of  the  themes  in  the  running  of  the 
simulation  was  changed  to  place  the  most  important  themes 
first.  The  actual  order  of  themes  in  the  scenario  (and 
so  presented  in  Chapter  VIII  analyzing  the  output  statis¬ 
tics,  except  for  the  final  theme  which  was  lost  due  to  a 
programming  error)  follows: 

1.  U.N.  Peacekeeping 

2.  Russian-American  Relations 

3.  Violence,  Threats  to  Peace,  War 

4.  Great  Power  Dissension  in  the  U.N. 

5.  Any  Reference  to  the  Veto  Power 

6.  Veto  Only  in  Security  Council 

7.  Veto  Only  by  Great  Powers 

8.  U.N.  and  Human  Rights 

9.  Mention  of  the  Information  Campaign  Sponsors 

10.  Explicit  Explanation  of  the  U.N. 

11.  Satisfaction  with  the  U.N. 

12.  Dissatisfaction  with  the  U.N. 

13.  Control  of  the  Atomic  Bomb 
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Coding  Reliability 

In  the  coding  of  the  content  analysis  of  the 
Cincinnati  newspapers,  two  coders  read  the  total  issues 
of  the  three  newspapers,  the  Enquirer  (both  daily  and 
Sunday) ,  The  Times-Star,  and  the  Post.  In  order  to  test 
txie  reliability  of  the  analysis,  each  of  the  coders  was 
required  to  code  each  of  three  one-week  periods.  These 
periods  were  the  first  week  in  November,  the  first  week 
in  December,  and  the  first  week  in  January.  However, 
several  thorny  difficulties  ari^  in  making  a  calculation 
of  the  reliability  for  this  kind  of  content  analysis. 

The  first  of  these  is  that  there  is,  unlike  the  case  of 
coding  survey  responses,  no  known  number  of  items  which 
must  be  coded,  i.e.,  the  number  of  items  relevant  to  any 
of  the  seventeen  themes  in  the  content  analysis  must  be 
decided  upon  by  the  judgment  of  the  coders  and  is  not 
known  beforehand.  Therefore,  we  have  decided  to  use  as 
a  base  the  total  number  of  stories  coded  i  ~  ese  thi 
weeks  by  the  two  coders,  realizing  that  some  stories  may 
have  escaped  the  notice  of  both  of  the  coders  and  so  tho 
"N"  may  be  larger  than  the  "N"  given  below.  However,  it 
would  seem  that  the  most  important  and  the  most  salient 
stories  in  terms  of  our  seventeen  themes  would  probably 
be  caught  by  at  least  one  of  the  two  coders. 

The  total  number  of  stories  coded  by  the  two 
coders  in  the  three  weeks  in  three  newspapers  was  240  stories. 
Of  these  stories,  19  34  percent  were  coded  by  Coder  No  .land 


149  or  62  percent  of  the  stories  were  coded  by  Coder  No.  2. 
The  number  of  stories  in  common  between  the  two  coders  was 
102  or  42.5  per  cent.  Because  Coder  No.  1  seemed  to  find 
significantly  more  stories  relevant  to  the  themes  than 
Coder  No.  2,  whenever  both  coders  coded  a  newspaper,  the 
coding  of  Coder  No.  1  was  used.  Of  the  total  number  of 
issues  of  newspapers  coded.  Coder  No.  1  coded  53  percent 
in  the  data  used  for  our  total  content  analysis.  Also, 
in  interviewing  the  Cincinnatians  involved  in  the  Cincinnati 
Plan,  these  people  indicated  that  the  Cincinnati  Post  was 
the  most  important  newspaper  in  terms  of  dissemination  of 
news  about  the  U.N.  and  international  affairs  in  Cincinnati. 
Therefore,  Coder  No..  1  was  assigned  to  this  newspaper  and 
coded  100  percent  of  the  issues  of  this  paper.  Coder  No.  1 
also  coded  47  per  cent  of  the  Times-Star  issues  and  only  12 
per  cent  of  the  Enquirer  issues.  Considering  the  proportion 
of  issues  coded  by  Coder  No.  1  and  the  relative  percentages 
of  the  total  number  of  stories  which  were  coded  by  each  of 
the  interviewers,  we  estimate  that  about  75  percent  of  the 
stories  relevant  to  our  seventeen  themes  were  coded  by  the 
combined  coding  operation.  Therefore,  25  percent  of  the 
relevant  stories  were  probably  not  coded  and  were  not  in¬ 
cluded  in  the  scenarios.  We  would  guess  that  these  stories 
which  were  missed  were,  however,  less  important  and  less 
obviously  salient  to  the  themes  than  the  stories  which  were 
coded.  Nevertheless,  the  reader  may  keep  in  mind  the  fact 
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that  perhaps  25  percent  c.:  the  stories  that  have  some 
sort  of  relevance  to  each  of  these  themes  were  not,  in 
fact,  included  in  the  coding  or  the  scenarios. 

A  second  major  problem  in  evaluating  the  relia¬ 
bility  of  the  coding  operation  is  the  fact  that  the 
categories  into  which  each  story  was  coded  were  not 
mutually  exclusive,  i.e.,  a  story  might  very  well  con¬ 
tain  several  of  the  themes  of  the  content  an? lysis.  For 
example,  it  would  not  be  uncommon  for  a  single  story  to 
contain  both  the  themes  of  the  U.N.  peacekeeping  and  of 
dissatisfaction  with  the  U.N.  1516  formula  recommended 
for  estimating  the  reliability  of  a  coding  operation  is 
Scott's  formula  for  Pi,  which  measures  the  ratio  of  the 
actual  difference  between  the  observed  and  expected  agree¬ 
ments  between  coders  to  the  maximum  dif fere-  ce  between  the 

» 

observed  and  expected  agreement.^  Thus,  Scctt  s  Pi  is  a 
measure  of  coder  reliability  which  varies  between  the 
limits  of  0.0  and  1.0.  However,  Scott's  Pi  assumes  that 
each  of  the  coding  operations  produces  one  and  only  one 
coding  assignment  for  each  story.  If  we  disregard  this 
assumption,  we  are  essentially  throwing  out  the  fact  that 
not  coding  a  story  as  relevant  to  a  particular  theme  by 


c  William  A.  Scott,  "Reliability  of  Content  Analysis; 
a  Case  of  Nominal  Scale  Coding, "  Public  Opinion  Quarterly. 
(Fall  1955),  321-25. 
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both  coders  is  an  instance  of  agreement.  At  any  rate,  dis¬ 
regarding  this  fact,  the  value  for  Scott's  Pi  is  .54  indica¬ 
ting  only  a  low  degree  of  rel’ ability  in  the  coding  of  the 
Cincinnati  papers. 

There  is  another  way  to  look  at  the  reliability 
of  the  coding,  a  way  which  ha3  faults  but  may  nevertheless 
be  instructive  to  the  re*«<7.er.  We  might  look  at  the  total 
number  of  themes  which  •  jrc  assigned  to  the  total  number 
of  stories  which  were  coded  in  common  and  the  number  of 
these  which  were  in  agreement.  (This  measure  is  biased 
by  the  fact  that  if  we  have  only  two  possible  themes  (i.e., 
coding  categories)  we  have  a  high  probability  of  agreement, 
and  as  ve  have  more,  i.e.,  seventeen  themes  to  assign,  we 
have  much  less  possibility  of  agreement  simply  on  the  basis 
of  chance.)  This  measure  shows  63.3  percent  of  the  coding 
choices  in  agreement.  We  might  also  look  at  the  individual 
themes  for  the  proportion  of  coding  in  agreement.  The  most 
important  themes  (in  terms  of  number  of  stories)  were 
'Themes  Nos-  1,  2,  11,  15,  16,  and  17.  For  these  themes, 
the  proportion  of  the  total  choices  of  the  theme  which 
were  made  by  both  of  the  coders  are:  Theme  1-50  percent. 
Theme  2-33  percent.  Theme  11  -  89  percent.  Theme  15  - 
89  percent.  Theme  16  -  40  percent,  and  Theme  17  (which 
was  by  far  the  most  common  theme)  -  89  percent.  Judging 
from  the  kinds  of  information  contained  in  each  of  these 
themes,  it  seems  clear  that  the  coders  had  a  high  degree 
of  agreement  about  such  things  as  threats  to  peace  or 
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wars  (violence)  but  for  such  other  less  clear-cut  topics 
as  dissatisfaction  with  the  U.N.  or  the  U.N.  and  world 
health,  the  agreement  was  much  lower. 

We  may  summarize  our  findings  about  the  relia¬ 
bility  of  the  content  analysis  in  this  manner;  there  was 
relatively  high  reliability  in  coding  the  stories  relevant 
to  one  or  another  theme  in  tl:^  content  analysis.  For  cer¬ 
tain  themes,  the  agreement  between  the  coders  on  the 
relevance  of  the  story  for  the  theme  was  very  high. 

These  themes  were  Themes  Nos.  11,  15,  and  17,  which  are 
the  "Campaign  Sponsors  for  the  U.N. , "  "Control  of  the 
Atomic  Bomb,"  and  "Threats  to  Peace*  Wars."  The  content 
analysis  could  surely  have  been  improved  with  more  and 
better  training  of  the  analysts.  One  of  the  obvious 
problems  was  that  Coder  No.  2  did  not  attend  to  stories 
of  possibly  lesser  significance  for  the  seventeen  themes. 

While  there  is  rcom  for  improvement  in  this  con¬ 
tent  analysis,  it  nevertheless  seems  reasonably  accurate 
as  input  for  the  simulation  scenarios,  especially  in 
light  of  the  level  of  precision  in  specifying  the  various 
parameters  of  the  media  system,  and  the  kinds  of  assumptions 
necessary  to  generate  the  radio  messages. 

Estimation  of  the  Radio  Messages 

The  second  column  in  Table  VII-1  contains  the  number 
of  radio  occurrences  of  each  of  the  themes.  There  was  no 
way  to  directly  ascertain  what  kind  of  messages  were  broadcast 


.  '  VT*-  y**rT* 


mr^ 


242 

during  the  six-month  period  of  their  distribution  through¬ 
out  the  media  system.  Therefore,  some  estimate  had  to  be 
made  on  the  basis  of  the  kinds  of  stories  that  were  carried 
in  the  newspapers.  This  was  done  in  the  following  manner; 
first,  it  was  assumed  that  there  was  a  correspondence  be¬ 
tween  the  messages  appearing  in  the  morning  and  early  after¬ 
noon  newspapers,  i.e.,  in  the  daily  Enquirer,  and  the  radio 
news  during  the  morning  and  early  afternoon.  Likewise,  we 
assume  that  there  was  a  correspondence  between  the  news  in 
the  two  afternoon  newspapers,  the  Times-Star  and  the  Post, 
and  the  afternoon  and  evening  radio  news  broadcasts. 

1  inally,  we  assume  that  the  Sunday  news  broadcasts  reflected 
the  news  in  the  Sunday  Enquirer.  However,  the  amount  of 
time  devoted  to  international  news  in  news  broadcasts  is 
quite  limited  and  not  nearly  ail  of  the  stories  or  themes 
found  in  the  newspaper  would,  in  fact,  be  found  in  the 
corresponding  radio  broadcast.  Therefore,  we  used  the 
ad  hoc  formula  that  only  if  a  story  appeared  on  the  front 
page  of  the  newspaper  would  it  appear  in  the  corresponding 
radio  broadcast,  on  this  basis,  we  estimated  the  occurrence 
of  the  themes  in  the  radio  broadcasts  and  tlese  are  the 
numbers  which  are  presented  in  the  second  column  of 
Table  VII -1.  It  will  be  noted  that  in  general  these 
numbers  are  much  larger  than  the  number  of  occurrences 
in  the  newspaper  even  though  they  reflect  only  those 
newspaper  occurrences  which  are  found  on  the  front  page. 

This  is  because  in  general  one  front-page  newspaper 
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newspaper  occurrence  was  likely  to  generate  from  fi,Te  to 
ten  radio  occurrences,  one  at  each  hourly  time  slot  when 
the  news  broadcast  took  place.  Thus,  the  mention  of  one 
of  the  themes  on  the  front  page  of  the  daily  Enquirer 
would  imply  that  the  theme  was  broadcast  in  the  7:00  A.M. , 
8:00  A.M. ,  9:00  A.M. ,  10:00  A.M. ,  11:00  A.M.,  12:00  A.M., 
1:00  P.M.,  and  2:00  P.M.  news  broadcasts.  It  is  true  in 
some  cases  that  the  number  of  radio  occurrences  is  less 
than  the  number  of  newspaper  occurrences  or  nearly  equal 
to  the  number  of  newspaper  occurrerces.  This  is  an  indica¬ 
tion  that  most  of  the  newspaper  occurrences  did  not  take 
place  on  the  front  page  of  the  newspaper?  that  is,  the 
stories  were  not  considered  most  important  news.  These 
stories  are  generally  just  what  we  would  expect,  i.e., 
themes  relating  to  the  U.N.  and  human  rights,  the  U.N.  . 
and  world  trade  or  world  health,  UNESCO,  slogan  of  the 
information  campaign,  explicit  explanation  of  the  U.N. , 
or  references  to  the  veto  power.  Thus,  the  relative  in¬ 
crease  in  number  of  radio  occurrences  over  the  number  of 
newspaper  occurrences  can  be  taken  as  an  indication  of 
the  relative  importance  of  each  occurrence  of  the  theme, 
i.e.,  its  location  on  the  front  page. 

There  is  one  other  note  of  explanation  about  the 
table.  In  the  simulation,  a  maximum  of  1,800  occurrences 
of  any  single  theme  is  allowed;  therefore,  it  was  necessary 
to  somehow  cut  down  the  number  of  stories  or  occurrences  of 
the  final  theme  which  totaled  2,202  occurrences  combining 
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both  newspaper  and  radio.  We  did  this  in  a  rather  arbitrary 
fashion,  simply  discarding  one  out  of  every  five  of  the 
occurrences  of  the  last  theme.  Although  at  this  stage  c*' 
the  simulation,  the  effect  of  this  paring  down  of  the  num¬ 
ber  of  messages  is  not  obvious,  we  shall  find  upon  examina¬ 
tion  of  several  features  of  the  output  statistics  (in 
Chapter  VIII),  that  most  probably  the  effect  is  to  diminish 
the  overall  average  level  of  exposure,  but  not  to  change 
signifxcantiy  the  distribution  of  exposure  over  the  popula¬ 
tion  types.  The  figures  resulting  from  this  paring,  plus 
the  figures  for  the  other  themes,  are  the  basis  for  the 
scenario  inputs  to  the  exposure  part  of  the  simulation. 

It  is  possible  that  there  may  be  some  exaggeration 
in  the  number  of  radio  stories  created  by  this  estimation 
procedure  because  of  the  possibility  of  the  same  story 
with  identical  themes  being  carried  on  the  front  pages 
of  both  of  the  evening  newspapers.  In  this  case,  the 
story  would  appear  twice  in  each  of  the  radio  news  broad¬ 
casts  corresponding  tc  the  afternoon  newspapers.  This 
kind  of  duplication  appears  unavoidable  since  it  would  be 
very  complex  to  detect  from  the  coding  whether  the  exact 
story  in  the  same  form  was  presented  in  both  of  the  evening 
newspapers.  In  order  to  check  the  amount  of  error  possible 
here,  we  have  calculated  the  total  number  of  front  page 
stories  appearing  in  the  Times-Star  and  the  Post.  In  the 
Times -Star,  this  total  is  50  and  for  the  Post  the  total  is 


128.  Therefore,  the  largest  number  of  stories  which  could 


possibly  be  erroneously  duplicated  in  the  radio  news  broad¬ 
casts  is  50  and  more  than  likely  it  is  significantly  small¬ 
er  than  50.  (The  figure  50  would  be  true  if  each  of  the  50 
stories  that  appeared  on  the  Times-Star  front  page  also  ap¬ 
peared  on  the  Post  front  page  carrying  the  same  themes.) 
Therefore,  we  conclude  that  there  is  a  likelihood  of  some 
exaggeration  in  the  number  of  evening  radio  occurrences  of 
the  themes,  but  that  it  is  relatively  small.  Given  the 
underestimation  of  the  total  number  of  occurrences  of  themes 
in  the  newspaper  (we  estimated  above  that  about  75  percent 
of  the  stories  which  could  have  been  coded  were  actually 
coded)  this  error  is  even  less  important. 

The  Estimation  of  Newspaper  Message  Exposure  Probabilities 
The  simulation  up  to  this  point  has  produced  prob¬ 
abilities  of  exposure  to  the  various  vehicles  in  the  mass 
media  system.  If  we  were  to  run  a  simulation  based  on  these 
probabilities,  the  resulting  exposures  would  be  exposures  to 
the  vehicle.  However,  what  we  want  is  the  exposure  of  peo¬ 
ple  to  messages  contained  in  the  vehicles.  For  this  expo¬ 
sure  we  need  some  estimate  of  the  probability  of  exposure 
of  an  individual  to  a  message  in  a  vehicle,  given  that  the 
individual  has  been  exposed  to  the  vehicle.  Then  the  prob¬ 
ability  that  the  individual  is  exposed  to  a  particular 
message  in  a  vehicle  is  just  the  product  of  the  two  prob¬ 
abilities.  Let  us  call  the  probability  >f  exposure  to  a 
message  in  a  vehicle,  given  exposure  to  the  vehicle,  a 
message  exposure  probability. 


'wm 


Exposure  to  the  message  depends  upon  two  factors; 
the  interest  of  the  reader  in  the  topic  and  the  play 
which  the  message  gets  in  the  format  of  the  medium.  If 
for  each  theme  in  the  simulation,  we  had  a  large  number 
of  stories  of  differing  formats,  with  an  estimate  of  the 
proportion  of  the  vehicle  audience  who  were  exposed  to 
each  story,  then  we  could,  in  principle,  separate  out 
the  effects  of  formating  from  the  effects  of  the  content 
of  the  theme  upon  the  message  exposure  probabilities. 
Unfortunately,  we  have  not  been  able  to  find  a  large 
sample  of  news  stories  for  each  of  the  themes  and  for 
the  1947-1948  time  period  for  which  these  exposure  data 
are  available.  We  have  found  127  stories  relating  to 
general  international  news  for  which  these  data  are 
available;  from  them  we  have  estimated  (using  multiple 
regression)  the  effects  of  format  on  the  conditional 
message  exposure  probabilities  for  international  news. 

The  basis  for  the  estimation  of  message  exposure 
probabilities  is  a  study  done  by  the  Advertising  Research 


Foundation  called  The  Continuing  Study  of 


r  Readin 


which  is  actually  a  series  of  138  studies  of  different 
newspapers  dating  from  1939  to  1950.  The  aim  of  this 
series  of  studies  was  to  look  at  the  readership  of 
various  kinds  of  articles  in  newspapers  given  that  a 
person  had  read  at  all  in  the  newspaper.  We  have  used 
the  data  on  international  news  stories  in  the  1947-1948 
studies  for  a  multiple  regression  predicting  readership 
of  men  and  women  based  on  various  format  factors. 


Each  of  the  studies  was  conducted  in  the  following 
manner.  A  quota  sample  was  selected  representing  various 
economic  levels  in  the  city  zone  of  the  newspaper  in  their 
proper  proportion  and  paralleling  (geographically)  the 
circulation  of  the  newspaper.  This  sample  consisted  of 
225  men  and  225  women  who  were  admitted  readers  of  the 
previous  day's  issue  of  the  newspaper.  When  the  inter¬ 
viewer  contacted  a  respondent  according  to  this  quota 
sampling  procedure,  and  the  respondent  had  read  at  all 
in  the  previous  day's  issue  of  the  paper,  the  interviewer 
then  presented  the  respondent  with  a  copy  of  that  issue 
of  the  paper  and  asked  the  respondent  to  go  through  the 
newspaper  page  by  page  marking  with  crayon  the  articles, 
pictures,  and  advertisements  which  he  remembered  having 
read  the  previous  day.  The  interviewers  were  instructed 
to  use  only  two  questions  in  their  interviewing.  The 
first  was,  "Did  you  happen  to  see  or  read  anything  on 
this  page?"  and  the  second,  "Did  you  happen  to  see  or 
read  any  of  the  advertisements  on  this  page?"^ 

The  data  from  these  450  interviews  is  presented 
in  an  exact  copy  of  the  issue  of  the  newspaper,  printed 

o 

A  summary  of  the  studies,  the  findings,  and  the 
methodology  is  presented  in  The  Continuing  Study  of  News¬ 
paper  Reading:  138  -  Study  Summary,  The  Advertising 
Research  Foundation  (New  York:  Advertising  Research 
Foundation,  Inc.,  1951). 


in  a  large  journal  form.  Superimposed  on  each  article, 
advertisement,  picture,  or  cartoon  is  the  proportion  of 
men  and  the  proportion  of  women  who  reported  having  read 
the  article.  We  have  then,  for  each  article  in  the  news¬ 
paper,  the  proportion  of  men  and  the  proportion  of  women 
who  admit  to  having  read  the  art.icle,  given  that  they  have 
been  exposed  to  the  vehicle  at  all.  This  is  the  basis  for 
our  estimates  of  message  exposure  probabilities.  (Since 
exposure  data  are  reported  only  by  sex,  this  is  the  only 
demographic  variable  considered  in  calculating  message 
exposure  probabilities.) 

Of  the  138  studies.  Studies  No.  110  through  120 
occurred  during  the  time  period  from  June  19,  1947,  to 
June  3,  1948.  This  is  a  total  of  eleven  studies  of 
which  nine  are  evening  papers  and  two  are  morning  papers. 

A  list  of  these  papers,  the  dates,  study  numbers,  and 
circulations,  is  given  below.  As  can  be  seen  from  the 
table,  the  newspapers  range  in  size  from  the  Athens 
Messenger.  (Athens,  Ohio)  with  a  circulation  of  18,805 
to  the  Washington  Post  (Washington,  D.C.)  with  a  circula¬ 
tion  of  165,554. 

From  these  11  newspapers  a  total  of  127  stories 
dealing  with  international  political  news*  were  coded  for 

*  Typical  titles  for  these  stories  included i  "Vandenberg 
Plan  Appears  Assured  of  U.S.  Blessing,"  "Russian  Soldiers  Out¬ 
number  Yanks  by  20  To  1  Ratio,"  "My  Day"  by  Eleanor  Roosevelt, 
and  "Pravda  Assails  Acheson's  Speech  as  Rude  Slander.” 


VII-2.  Newspaper  Studies  from  Which  International  News  Items  were  Coded 
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several  format  characteristics  and  the  percentages  of  men 
and  the  percentages  of  women  exposed.  One  set  of  variables 
used  to  describe  the  story  formats  referred  to  the  page 
location  of  the  story.  The  categories  were  (1)  front  page 
lead  story,  (2)  other  front  page,  (3)  second  or  third  page, 
(4)  editorial  page  or  pages,  (5)  women's  page  or  pages, 

(6)  other  inside  pages.  Other  variables  described  whether 
the  story  was  in  print  or  a  picture  or  both,  whether  it 
was  located  at  the  top  of  the  page,  the  middle,  or  the 
bottom  of  the  page,  the  caption  width  in  columns,  and  the 
number  of  column  inches  in  length.  (Of  course,  for  all 
except  the  last  two  of  these  variables  the  coding  was  a 
1  or  a  0  as  the  variables  are  actually  dummy  variables.) 

Then  the  linear  regression  was  run  in  a  stepwise  fashion, 
independent  variables  being  added  one  at  a  time  in  the  order 
of  their  contribution  to  the  explanation  of  the  variance  in 
the  dependent  variable.  In  this  way  the  most  important  of 
the  independent  variables  were  isolated. 

Results  of  the  Multiple  Regression 

First  we  examine  the  mean  exposure  probabilities 
for  men  and  for  women  for  the  127  stories.  For  the  men 
the  average  exposure  was  21.6  percent  with  a  standard 
deviation  of  13.49  percent.  For  the  women  the  mean 
exposure  was  13.0  percent  with  a  standard  deviation  of 
11.12  percent.  These  numbers  are  validated  somewhat  by 
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a  study  done  by  Swanson  which  is  a  summary  of  all  138  studies 
xn  the  Continuing  Study .  Swanson  found  the  following  summary 
readership  figures  fox  various  categories  of  articles.  For 
war  news,  the  mean  male  readership  was  39.4  percent  and  the 
mean  female  readership  was  29.8  perc*-  t;  for  defense  news? 
males  30.8  percent,  females  27.4  percent;  for  economic-social 
international  relations  news,  male  readership  22.1  percent, 
female  readership  18.1  percent;  for  political  international 
relations  news,  male  readership  24.0  percent,  female  reader- 
ship  15.5  percent;  for  general  politics  news,  male  reader- 
ship  19.6  percent,  female  readership  14.0  percent.  The  dif¬ 
ferences  between  Swanson's  figures,  which  cure  somewhat  higher 
than  the  means  in  the  127  articles,  probably  come  from 
changes  over  time  and  the  categorization  of  the  articles. 

At  any  rate,  Swanson's  figures  for  general  international 
news  are  not  widely  different  from  the  figures  for  the 
stories  recorded  during  the  year  from  June  1947  to  June 
1948. 

In  the  table  below  we  present  the  result  of  the 
stepwise  linear  regression.  For  men  we  have  used  a  re¬ 
gression  equation  with  six  variables  and  a  constant 
which  has  a  multiple  R  correlation  coefficient  of  .6900 

^ Charles  E.  Swanson,  "What  they  Read  in  130  Daily 
Newspapers,"  Journalism  Quarterly,  32:4,  (Fall,  1955), 


411-421. 
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(significant  at  the  .0000+  level)  and  which  indicates  that 
the  format  factors  explain  about  48  percent  of  the  variance 
in  the  dependent  variable,  the  proportion  of  men  reading  the 
article.  For  women  seven  independent  variables  were  included 
which  together  produce  a  multiple  R  of  .7267  (which  is 
significant  at  the  .0000+  level  of  probability)  and  which 
account  for  approximately  52  percent  of  the  variance  in 
the  female  readership.  We  note  also  that  there  are  several 
rather  plausible  differences  between  the  men  and  women  in 
those  independent  variables  which  contribute  most  to  the 
prediction  of  readership.  The  location  of  the  article  on 
either  the  front  page  or  the  editorial  pages  is  important 
in  prediction  of  male  readership  but  not  for  female  reader- 
ship;  however,  the  location  on  the  second  or  third  pages, 
other  inside  pages,  on  women's  pages,  or  the  presence  of 
a  picture  is  important  in  prediction  of  female  readership 
but  not  for  male  readership.  These  equations  were  then 
used  to  predict  the  readership  of  the  stories  actually 
coded  in  the  content  analysis  of  the  Cincinnati  press  on 
the  basis  of  the  formating  of  the  stories. 

At  this  point  perhaps  it  is  appropriate  to  comment 
on  the  significance  of  these  equations.  It  seems  reason¬ 
able,  especially  given  the  fact  that  there  are  no  other 
methods  available,  to  use  these  equations  to  predict  the 
readership  for  men  and  women  of  the  Cincinnati  news 
stories.  The  equations  were  derived  from  data  which  is 
in  many  respects  similar  to  the  Cincinnati  news  stories 
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both  in  the  content  and  in  the  time  of  the  story.  Although 
we  cannot  be  sure  that  the  population  of  news  stories  of 
which  our  127  are  a  sample  is  equivalent  to  the  population 
of  news  stories  in  the  Cincinnati  newspapers,  for  present 
purposes  we  are  willing  to  make  this  assumption. 

Another  factor  is  quite  obvious  from  tha  statis¬ 
tics  of  the  regression.  The  format  factors,  at  least  as 
they  are  characterized  in  the  present  content  analysis, 
serve  to  explain  only  half  of  the  variance  in  the  reader- 
ship  figures  for  men  and  women.  We  might  ask,  then,  what 
additional  variables  would  be  necessary  to  more  completely 
specify  this  readership.  In  reading  and  coding  these 
stories,  it  became  obvious  to  the  author  that  certain  phrases 
or  words  in  the  caption  of  the  stories  serve  to  attract  the 
attention  of  the  eye  and  increase  the  likelihood  of  reading 
the  story.  For  the  author  (a  male)  words  such  as  "seize, " 
"threaten, "  or  phrases  such  as  "Russian  Soldiers  Outnumber, " 
or  "Bigger  Atom  Bombs"  served  to  attract  attention.  We 
theorize  then  that  much  of  the  remaining  variance  in  the 
readership  cf  these  stories  is  dependent  upon  the  effects 
of  certain  key  words  or  phrases  in  the  captions  upon  the 
audience.  One  might  try  to  design  an  experiment  which  would 
try  to  separate  the  effects  of  these  kinds  of  phrases  from 
the  formating,  etc.,  but  at  first  glance  this  seems  to  be  a 
very  difficult  research  problem.  There  is,  however.,  another 
way  to  attack  the  problem..  One  might  imagine  that  the  edi¬ 
tors  or  writers  who  °et  the  captions  for  the  stories  are 
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very  astute  at  picking  phrases  and  words  which,  given  the 
content  of  the  story,  will  best  attract  the  readership  of 
the  audience.  Thus  one  might  be  willing  to  make  the  assump¬ 
tion  that  the  content  of  the  story  will  be  represented  in 
the  most  favorable  way  in  the  caption  for  the  story  and 
that  a  description  of  the  effect  of  the  content  of  the 
story  itself  would  go  a  long  way  toward  explaining  the 
effects  of  certain  of  these  key  words.  This  is  an  area 
for  further  research  and  we  have  not  seen  any  research  on 
this  kind  of  effect  in  any  of  the  journals.6  At  any  rate, 
it  is  obvious  that  even  within  the  limited  message  contents 


6  The  key  words  or  phrases  in  the  caption  obviously  have 
an  effect  upon  the  readership  of  the  article  but  also  the  con¬ 
tent  of  the  article  has  an  effect  upon  the  words,  phrases,  that 
appear  in  the  caption.  One  possible  model  might  assume  an 
average  level  of  attention  due  to  the  content  of  the  article 
and  perhaps  a  more  specific  additive  effect  upon  attention  if 
the  content  of  the  article  allows  the  use  of  certain  kinds  of 
words  or  phrases  which  are  superior  attention-getting  words 
and  phrases.  Also  it  would  seem  likely  that  the  attention 
paid  to  a  given  word  or  phrase  by  the  reader  depends  upon  the 
characteristics  of  the  reader,  in  particular  upon  his  sex, 
education  and  quite  likely  some  dimensions  of  personality. 

In  much  the  same  vein,  Tannenbaum  and  Lynch  differentiate 
between  two  dimensions  in  their  conception  of  sensationalism, 
i.e.,  topical  and  stylistic  sensationalism.  They  present  a 
factor  analysis  of  twenty  semantic  differential  scales  which 
identifies  the  evaluative ,  the  excitement,  and  the  activity 
factors  underlying  the  concept.  However,  they  make  no  attempt 
to  isolate  the  effects  of  single  words  or  phrases,  or  to  deter- 
B  mine  if  interaction  exists  with  personality  variables  in  the 

reader.  See  Percy  H.  Tannenbaum  and  Mervin  D.  Lynch,  "Sensa¬ 
tionalism:  The  Concept  And  Its  Measurement;"  Journalism 
Quarterly,  37:3  (1960),  p.  381-391. 
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and  themes  of  the  present  analysis  there  seems  to  be  enough 
variety  of  content  or  theme  and  variety  of  attention  gather¬ 
ing  expressions  and  phrases  in  the  captions  of  the  messages 
so  as  to  limit  to  about  50  percent  the  amount  of  variance 
in  readership  Which  can  be  explained  on  the  basis  of  format 
factors  alone.7 

Generation  of  Exposure  Probabilities  for  Occurrences  of  the 
Themes  in  Radio  News  Broadcasts 

Por  any  occurrence  of  a  theme  in  the  newspaper,  we 
are  able  to  generate  a  probability  of  exposure  based  on 
the  multiple  regression  developed  above.  Every  newspaper 
message  is  assigned  a  probability  of  exposure  for  men  and 
for  women  from  this  procedure.  The  next  problem  is  to 
estimate  these  probabilities  of  exposure  for  the  radio 
occurrences  of  these  messages.  We  did  this  by  making 
some  assumptions  about  consistency  between  the  appear¬ 
ance  of  the  stories  in  the  newspaper  and  the  length  and 
emphasis  they  would  be  given  in  a  news  broadcast.  We 
assumed  that  the  newspapers  and  the  radio  make  consistent 
news  judgments  when  we  decided  to  attribute  to  the  radio 
broadcasts  those  messages  which  occurred  on  the  front  page 


Clinton  R.  Bush  in  a  research  note  "Content  and 
'Wise  en  Valeur ' :  Attention  as  Effect,4'  Journalism 
Quarterly.  37:3,  (I960),  p„  435-6,  presents  a  formula 
for  weighting  journalistic  text  depending  on  elements  of 
format  and  typography  developed  by  a  French  professor, 
Jacques  Kayser.  Bush  opines  however,  that  "...the  words 
in  the  headline  are  probably  more  important  than  the  dis¬ 
play." 


of  the  newspapers.  At  this  point,  we  go  one  step  further  and 
assume  that  those  front  page  stories  which  had  the  highest 
probabilities  of  exposure  in  the  newspaper  will  also  be  given 
the  kind  of  emphasis  that  will  result  in  their  having  the 
highest  probabilities  of  exposures  in  the  radio  broadcasts. 
However,  it  is  difficult  to  imagine  that  the  probabilities  of 
exposure  are  equal  for  stories  in  the  two  media.  It  seems 
reasonable  to  assume  that,  even  given  that  the  person  is  ex** 
posed  to  the  newspaper  as  a  vehicle,  he  has  a  great  amount  of 
choice  in  the  actual  story  to  which  he  attends.  Therefore, 
we  expect  quite  a  difference  between  the  number  of  people  ex¬ 
posed  to  the  vehicle  and  the  number  of  people  exposed  to  any 
given  message  in  the  vehicle,  i.e.,  we  assume  that  the  message 
exposure  probabilities  for  a  given  newspaper  message  will  be 
rather  low  and  indeed  we  have  found  them  to  have  means  of  21 
and  13  percent  for  men  and  women  respectively.  However,  we 
assume  that  the  possibility  for  choice  in  attention  to  the 
various  radio  messages  is  not  so  strong  as  with  newspapers, 
given  that  the  person  is  exposed  at  all  to  the  broadcast.  It 
will  be  recalled  that  the  definition  of  a  member  of  the  audi¬ 
ence  of  a  radio  vehicle  is  someone  who  could  answer  certain 
questions  about  the  time,  station,  and  content  of  the  vehicle. 
The  probabilities  thus  generated  refer  not  to  the  instantaneous 
listenership  of  the  vehicle,  but  the  average  listenership ,  i.e., 
the  probabilities  represent  people  who  have  listened  to  the 
entire  program  or  have  at  least  tuned  in  for  the  duration  of 
the  program.  Also  we  have  estimated  (p.  121)  that  international 
news  occupied  about  three  minutes  of  a  fifteen-minute  newscast. 


Thus,  the  probability  that  a  listener  is  exposed  to  any 
given  message  in  the  vehicle  is  likely  to  be  higher  than 
the  probability  of  a  newspaper  reader  being  exposed  to 
a  given  message .  There  are  some  data  which  shows  that 
these  probabilities  cure  indeed  somewhat  higher.  Studies 
of  housewife  viewers  of  television  programs  have  shown 
that  about  25  percent  of  these  viewers  can  recall  the 
commercial  of  the  program  24  hours  later.  Contrast  this 
to  the  probability  of  exposure  to  a  given  message  in 

O 

the  newspaper  which  for  women  was  13  percent.  Of 
course,  some  of  this  difference  may  be  accounted  for 
by  the  fact  that  these  commercials  are  the  kind  of  inform¬ 
ation  that  are  aimed  at  women  and  presumably  women  have 
a  higher  likelihood  of  attending  to  these  sorts  of 
commercials  rather  than  the  average  story  in  a  newspaper. 
Nevertheless,  it  offers  some  support  for  the  fact  that  the 
24  hour  recall  of  stories  in  electronic  vehicles  by  a  mem¬ 
ber  oi  the  average  audience  is  somewhat  larger  than  the 
24  hour  recall  of  the  particular  message  in  a  newspaper . 
Therefore,  in  calculating  the  probabilities  of  exposure 
to  the  radio  messages,  we  have  made  a  systematic  increase 
in  the  probabilities  generated  for  the  newspaper  occurrences. 

O 

These  data  sure  taken  from  studies  by  Foote,  Cone, 
and  Belding  in  Queens  and  St.  Louis,  which  found  twenty- 
four  hour  recall  of  26  percent  and  23  percent,  and  Look 
Studies  of  Advertising  Communication  which  found  24  per¬ 
cent  rt  .il  after  24  hours.  The  studies  cure  cited  in  a 
volume  called  Media  Balance,  Life  Incorporated,  1965,  p.  7. 


Obviously,  the  probability  1.0  should  not  be  increased  and 
probabilities  near  this  value  should  be  only  slightly  in¬ 
creased,  but  in  such  a  way  so  as  never  to  exceed  the  value 
1.0.  We  have  also  decided  that  the  value  0.0  should  remain 
the  same  and  that  the  maximum  increases  should  occur  for 
probabilities  of  0.5.  With  P*  representing  the  transformed 
values,  the  following  parabolic  function  effectively  accounts 
for  the  ceiling  an*3  floor  effects: 

P*  =  -0.8P2  +  1.8  P. 

Probabilities  equal  to  zero  remain  at  zero.  Probabilities 
equal  to  1.0  remain  at  that  value,  but  probabilities  inter¬ 
mediate  between  those  are  increased  with  the  maximum  increase 
happening  to  a  probability  of  .5  which  is  increased  to  .72. 
The  table  below  shows  the  increases  in  probabilities  for 
various  values  of  the  newspaper  estimated  probabilities. 

If  we  assume  that  most  of  the  newspaper  probabilities  were 
on  the  order  of  .1,  then  the  transformation  causes  an  in¬ 
crease  in  probabilities  of  about  70  percent.  However, 
over  the  entire  range  the  increase  is  13  percent. 


Original 

(Newspaper) 

Probabilities 


Transformed 

(Radio) 

Probabilities 


0.0 

0.000 

0.1 

0.172 

0.3 

0.468 

0.5 

0.720 

0.7 

0.868 

0.9 

0.972 

1.0 

1.000 

Some  Observations  About  the  Message  Exposure  Estimates 

We  note  here  that  the  definition  and  data  used  in 
generating  both  the  newspaper  and  j  adio  probabilities  refer 
to  the  chance  that  a  member  of  th:,  audience  of  a  vehicle 
will  be  exposed  to  and  recall  a  message  in  the  vehicle  24 
hours  after  the  appearance  of  the  vehicle.  Therefore,  the 
exposures  which  are  output  from  the  simulation  are  really 
estimates  of  (aided)  recall  24  hours  after  the  appearance 
of  the  vehicle.  When  we  cumulate  these  sorts  of  exposures 
over  a  significant  length  of  time  such  as  six  months,  we 
are  obviously  making  some  distortion  in  the  kind  of  knowl¬ 
edge  which  is  imparted  by  exposure  to  these  messages.  It 
seems  quite  likely  that  there  is  some  decline  over  a  six 
month.  period  in  recall  of  the  message  from  the  proportion 
recalling  at  the  end  of  24  hours.  However,  on  the  other 
land  we  are  not  in  general  trying  to  ascertain  whether  a 
person  has  been  exposed  to  a  particular  message  but  rather 
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some  general  effects  of  exposure  to  messages.  This  is 
the  kind  of  thing  that  is  measured  by  the  questions  asked 
in  the  NORC  survey.  One  might  imagine  that  these  general 
and  perhaps  unconscious  or  unrealized  effects  diminish  with 
time  or  perhaps  even  increase  (i.e. ,  "sleeper  effect")  in 
time  much  less  rapidly  than  recall  of  exposure  to  a  given 
message.  We  suggest,  therefore,  that  we  may  hope  to  find 
a  correlation  between  the  kinds  of  people  frequently  ex¬ 
posed  to  a  certain  theme  and  questions  of  information  or 
attitude  at  the  end  of  a  six  month-  period  which  are  re¬ 
lated  to  that  theme. 

Finally,  it  should  be  noted  that  the  simulation 
takes  these  probabilities  for  message  exposure  for  men 
and  women  and  uses  them  in  a  nonobvious  way  in  the  simu¬ 
lation.  In  the  actual  simulation  we  have  made  assumptions 
about  the  relative  probabilities  of  exposure  to  the  message 
given  exposure  to  the  vehicle  for  breakdowns  of  the  audience 
other  than  simple  sex  breakdowns.  Thus,  from  our  two 
probabilities  we  make  other  assumptions  which  generate  a 
set  of  36  probabilities  which  distinguish  between  36 
audience  types  (each  population  dimension  excepting  inter¬ 
est  is  included)  but  which  nevertheless  preserve  the  proba¬ 
bilities  for  men  and  for  women  for  each  cf  these  messages 
that  we  have  calculated  up  to  this  point. 

There  is  one  obvious  failing  in  this  procedure, 
however,  in  that  these  mean  message  exposure  probabilities 
for  a  particular  audience  type  are  applied  to  each  person 


in  the  audience  type.  This  is  just  the  kind  of  situation 
which  we  went  to  some  lengths  to  avoid  in  the  first  pass 
of  the  simulation  where  we  use  cumulation  data  to  distrib¬ 
ute  individual  probabilities  about  the  audience  type  mean 
in  order  to  produce  the  cumulative  patterns  of  exposure 
which  are  known  to  exist.  In  order  to  do  this  at  this 
stage  of  the  simulation  would  require  a  tremendous  amount 
of  additional  programming,  computer  space,  and  time,  and 
for  this  reason  this  has  not  been  done.  We  have,  as  a 
matter  of  fact,  no  data  which  would  estimate  anything 
like  cumulation  in  this  case  and  so  we  cannot  go  any 
further,  but  we  note  that  this  is  obviously  a  simplify¬ 
ing  assumption  which  is  not  strictly  within  the  philosophy 

g 

of  the  previous  links  of  the  simulation. 


^  one  possibility  for  generating  probabilities  different 
from  the  mean  for  each  group  would  be  to  take  the  mean  plus 
another  figure  drawn  out  of  the  air  which  would  produce  a 
beta  function  distribution  of  the  probabilities  for  the 
group  about  the  mean.  This  other  figure  drawn  out  of  the 
air  would  be  chosen  in  such  a  way  as  to  make  this  beta  func¬ 
tion  a  bell-shaped  curve  about  the  group  mean.  Only  the 
parameters  for  this  beta  function  would  need  to  be  stored  in 
the  computer  and  as  a  member  of  an  audience  type  was  processed, 
a  probability  would  be  drawn  at  random  from  the  appropriate 
beta  function.  Without  knowing  the  cumulation,  we  could  not 
produce  it  in  the  choice  of  the  beta  function  but  the  pro¬ 
cedure  would  doubtless  generate  a  more  realistic  set  of 
probabilities  than  the  procedure  of  simply  using  the  mean 
probability  for  every  member  of  the  audience  type. 
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The  Simulation  Model  of  Message  Exposure 

Consider  what  might  be  a  reasonably  complete 
description  of  message  exposure  probabilities  for  the 
simulation.  How  shall  we  describe  the  message  exposure 
probabilities  for  a  single  message  in  the  scenario?  We 
might,  on  the  basis  of  the  theme  of  the  message  and  the 
format  factors,  generate  a  set  of  probabilities  of  expo¬ 
sure  to  the  particular  message.  These  probabilities  might 
be  as  detailed  as  mean  probabilities  for  every  audience 
type  in  the  simulation.  This  would  mean  that  we  could 
have  up  to  five  hundred  mean  probabilities  of  exposure 
associated  with  each  message  in  the  scenario.  Thus, 
the  probabilities  of  exposure  associated  with  any  one 
message  may  be  functions  of  the  content  of  the  message, 
the  formating  of  the  message,  the  vehicle  in  which  the 
message  is  carried,  and  finally  the  audience  type  for 
which  the  projection  is  to  be  made.  Clearly,  descrip¬ 
tion  in  this  much  detail  for  each  message  event  in  the 
scenario  is  quite  beyond  storage  or  time  limitations  of 
the  simulation.  Therefore,  it  was  necessary  to  devise  a 
simplified  model,  throwing  out  some  of  the  kinds  of  inter 
actions  which  might  be  possible,  in  order  to  arrive  at 
some  parsimonious  description  of  the  message  exposure 
probabilities . 

Moreover,  in  designing  the  model  we  wanted  to 
take  account  of  the  fact  that,  in  general,  the  researcher 
will  not  have  detailed  information  about  the  message 
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exposure  probabilities  for  each  message,  but  will  know  only 
an  “average"  conditional  probability  of  exposure  to  a  theme 
contained  in  a  vehicle.  (One  often  encounters  statements 
like  “About  twenty  percent  of  newspaper  readers  see  the 
average  foreign  affairs  story.")  Also,  the  researcher  will 
not  in  general  know  the  absolute  value  of  the  message  ex¬ 
posure  probabilities  for  various  groups,  but  will  be  able 
to  make  statements  like,  “Given  readership  at  all  in  the 
vehicle,  men  are  one-third  more  likely  than  women  to  read 
international  affairs  news."  The  following  simplified 
model  assumes  only  this  minimal  knowledge:  we  consider 
a  group  of  vehicles  and  audience  types  for  which  there 
is  some  average  proportion  of  the  vehicle  audience  who  are 
generally  exposed  to  a  given  theme.  This  is  an  average 
audience  of  the  thane,  "averaged"  over  all  possible  relevant 
messages  and  formats.  For  this  theme  and  group  of  vehicles 
we  define,  for  any  dimension  describing  the  vehicle  audience, 
ratios  between  the  probabilities  of  exposure  to  the  message, 
given  exposure  to  the  vehicle,  for  any  two  adjacent  levels 
of  the  dimension. 

We  assume  that  these  ratios  are  constant  for  all 
messages  in  this  theme  appearing  in  this  group  of  vehicles. 
For  example,  the  ratio  of  the  message  exposure  probability 
for  women  to  that  for  men  is  assumed  a  constant  over  all 
messages  and  all  vehicles  in  the  vehicle  group.  (We  have 
not  assumed  that  the  probabilities  of  exposure  for  men  or 


women  are  constant  for  all  vehicles  in  a  vehicle  group 


because  the  vehicle  audience  composition  for  the  various 
vehicles  may  differ  to  quite  a  degree.  If  a  constant 
proportion  of  that  vehicle  audience  sees  the  theme  gener¬ 
ally,  we  cannot  then  assume  that  the  proportion  of  men  and 
proportion  of  women  are  also  constant,  since  the  composi¬ 
tion  of  the  vehicle  audience  (the  number  of  men  and 
women)  changes  from  vehicle  to  vehicle. ^ 

These  assumptions  lead  to  the  following  equations: 
let  (Vj^)  and  (Vw^)  represent  the  number  of  males  and  fe¬ 
males,  respectively,  in  the  ith  vehicle  audience.  These  . 
figures  (or  the  breakdown  by  any  other  population  dimen¬ 
sion)  are  known  from  the  previous  links  of  the  simulation. 
Let  (A)  be  the  proportion  of  the  vehicle  audience  who 
generally  sees  the  theme.  This  is  a  known  “average”  over 
all  possible  formats,  something  intrinsic  about  the  theme. 
Let  (Pjh)  and  (Pwi)  represent  the  conditional  probabili¬ 
ties  of  exposure  for  men  and  women,  respectively,  to  the 
them  in  the  ith  vehicle  and  (R  =  /Pw^ )  be  the  (known) 
ratio  of  these  two  probabilities,  (at  present)  assumed 
constant  over  all  vehicles.  Then 


Actually,  it  would  be  a  relatively  simple  pro¬ 
cedure  to  allow  different  ratios  for  different  vehicles. 
As  presently  programmed,  the  simulation  does  occasionally 
call  for  changes  in  the  ratios  when  they  imply  probabili¬ 
ties  greater  than  1.0. 


PMi*VMi  =  Number  of  men  exposed  to  the  theme  in  the 
ith  vehicle 

PWi * Vwi  =  Number  of  women  exposed  to  the  theme  in  the 
ith  vehicle 

A' <VMi  +  VWi>  =  Total  number  exposed  to  the  theme  in 
the  i**  vehicle  and  therefore 


PMi‘VMi  +  PWi'VWi  “  »•  <VMi  +  VWi> 


(VII-l) 


with 


R  -  p«i  /pwi' 


(VII-2) 


For  each  vehicle,  the  two  equations  above  are 
solved  for  the  two  unknowns,  P^  and  Pwi,  and  the  theme 
audiences  by  sex  (Pj^  'V  and  Pwi-Vwi)  can  then  be  cal¬ 
culated.  For  dimensions  which  have  several  levels  or 
categories,  there  will  be  correspondingly  more  ratios  to 
specify,  but  the  procedure  and  solution  to  the  equations 
is  straightforward. 

For  any  vehicle  we  have,  at  this  point,  a  set  of 
consisted:  message  audiences  for  each  level  of  each  dimen¬ 
sion  for  which  ratios  have  been  specified.  Given  these 
marginal  values,  it  is  quite  easy  to  generate  message 
audiences  for  each  of  the  cells  in  the  table  implied  by 
the  combination  of  dimensions.  In  doing  this,  we  have 
two  choices,  however.  If  we  simply  take  cross-products 
of  these  values,  we  arrive  at  a  set  of  consistent  cell 
audiences,  but  audiences  which  imply  no  interaction 
among  the  dimensions.  This,  in  fact,  is  the  procedure 
we  have  used  in  the  simulation. 
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However,  there  is  another  way  to  generate  these 
cell  audiences.  We  might  have  initialized  the  cells 
with  the  values  of  the  vehicle  audience  and  performed 
the  smoothing  iteration*"*-  using  these  initial  values. 

The  result  of  this  operation  is  no  longer  the  production 
of  minimum  information  or  noninteraction  cell  values, 
but  the  generation  of  cell  values  which  reflect  the 
interaction  present  in  the  weighting  with  which  the 
iteration  was  initialized. 

There  are  good  grounds  for  considering  the 
initialization  of  the  iteration  with  the  vehicle  audience 
cell  values.  In  the  next  step,  after  we  have  generated 
the  message  audience  values  for  the  cells,  we  divide  each 
of  these  values  by  the  vehicle  audience  for  the  corres¬ 
ponding  cel1  in  order  to  get  a  cell  mean,  which  we  inter¬ 
pret  as  the  conditional  probability  of  exposure  to  the 
message,  given  exposure  to  the  vehicle,  for  a  member  of 
the  ceil  described  by  the  total  of  dimensions  for  which 
ratios  were  specified.  In  general,  it  is  possible  for 
this  procedure  to  px>duce  message  audiences  for  a  cell 
larger  than  the  cell  vehicle  audience.  This  happens 
occasionally  and  when  it  does,  we  reduce  the  probability 
for  that  cell  to  .98  and  report  to  the  researcher  the  net 
effect  in  decreasing  the  audience  of  that  message. 


11 


This  was  discussed  in  Chapter  II. 
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However,  initializing  the  cell  values  with  the  vehicle 
audiences  would  bias  the  message  audiences  in  the  cells 
in  the  same  way  that  the  vehicle  audience  is  biased.  In 
that  way,  although  the  marginals  would  remain  fixed, 
larger  message  audiences  would  occur  in  cells  with 
larger  vehicle  audiences  and  the  possibility  of  getting 
message  exposure  probabilities  greater  than  1.0,  would 
be  diminished.  It  also  would  imply  a  leveling  of  the 
probabilities  of  exposure  to  the  message  toward  some 
average  value,  however.  At  any  rate,  we  have  followed 
the  first  procedure  rather  than  the  second. 

There  is  one  last  factor  which  must  be  added  to 
the  iiodel  in  order  to  account  for  different  audience 
levels  for  different  messages  even  though  they  occur  in 
the  same  vehicles  carrying  the  same  theme.  We  call  this 
a  "format  factor"  (Fj  is  the  format  factor  for  the 
message)  and  it  is  a  positive  number  Which  multiplies 
the  average  message  exposure  probabilities  derived  above, 
to  give  the  j*"*1  message  exposure  probability.  Thus  the 
probabilities  derived  above  for  the  "average”  message 
format  of  the  thane  are  probabilities  for  a  message 
with  a  format  factor  equal  to  1.0.  Formats  which  in¬ 
crease  the  likelihood  of  exposure  to  the  message  have 
format  factors  greater  than  1.0;  formats  which  decrease 
the  likelihood  of  exposure  have  format  factors  less  than 
1.0.  Thus,  in  the  simulation  model,  in  order  to  specify 
a  scenario  of  messages  for  a  theme,  one  must  specify  the 
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general  average  audience  for  the  theme  and  a  set  of 
messages,  each  consisting  of  a  vehicle  number  and  a  for¬ 
mat  factor  which  multiplicatively  increases  or  decreases 
the  audience  of  that  message  with  respect  to  the  average 
theme  audience.  In  addition,  for  each  vehicle  group 
which  the  researcher  wishes  to  include,  he  may  specify 

ratios  of  conditional  exposure  probabilities  along  any 

12 

number  of  dimensions  of  the  population.  Thu3,  this  is 
a  much  simpler  body  of  data  for  the  computer  tc  store  and 
handle,  and  for  the  researcher  to  produce  than  the  most 
general  model. 

The  model  outlined  above  is  convenient  for  the 
average  researcher  who  does  not  have  a  great  deal  of 
data  about  individual  messages.  However,  for  the  present 
simulation,  this  model  implies  that  we  must  make  at  least 


The  product  of  the  largest  format  factor  and  the 
average  proportion  of  the  vehicle  audience  exposed  to  the 
thane  (F-ijnax’ gives  the  largest  proportion  of  the  vehicle 
audience J exposed  to  the  theme.  This  value  imposes  some 
constraints  on  the  possible  values  of  the  ratios  (R).  As 
an  extreme  example  if  the  maximum  message  audience  is  equal 
to  the  vehicle  audiences,  then  esch  conditional  probability 
of  exposure  must  equal  1.0;  therefore  all  ratios  must  equal 
1.0.  Obviously,  if  subgroups  for  which  ratios  are  specified 
are  quits  unequal  in  size  with  the  ratios  implying  that  the 
smallest  subgroup  have  the  largest  conditional  probability, 
then  probabilities  greater  than  1.0  may  be  generated  (and 
presented  quesfioningly  to  the  researcher)  by  the  simulation 
if  the  maximum  message  audience  is  a  sizeable  proportion  of 
the  vehicle  audience.  (There  is  an  option  in  the  simulation 
to  increase  and  /or  decrease  vehicle,  message,  and  'or  net 
probabilities  according  to  the  time  period  or  previous  ex¬ 
posure,  or  some  combination,  in  order  to  simulate  crisis 
situations  during  which  the  normal  probabilities  would  not 
be  expected  to  hold.  The  transformation  is  almost  the  same 
parabolic  transformation  with  ceiling  and  floor  effects  de¬ 
scribed  above,  p.  255. ) 
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one  extra  step  in  preparation  of  the  data  for  the  simula¬ 
tion.  In  the  discussion  of  the  message  exposure  probabili¬ 
ties  generated  in  the  scenario,  we  have  described  the 
generation  of  conditional  probabilities  of  exposure  for 
males  and  females  for  each  of  the  messages  in  each  of 
the  themes.  It  would  be  nice  if  we  could  use  these 
probabilities  directly  in  the  simulation;  however,  given 
the  model,  this  is  not  possible,  and  we  must  put  these 
messages  and  their  exposure  probabilities  into  the  frame¬ 
work  of  the  simulation  model. 

Recall  that  we  have  for  each  message  in  the  theme 
a  message  exposure  probability  for  both  men  and  women. 

The  model  implies  that  for  each  message  in  the  theme  and 
vehicle  group  there  is  a  constant  average  probability  for 
men  and  women  and  that  the  message  audience  variation  from 
message  to  message  is  due  to  a  format  factor  which  multi¬ 
plies  each  of  these  constant  probabilities  to  give  the 
particular  probabilities  for  the  message.  Therefore,  for 
each  theme,  the  ratio  of  our  probabilities,  the  message 
exposure  probability  for  men  and  the  message  exposure 
probability  for  women,  should  be  constant  over  all  messages. 
This  is  not  strictly  true  for  our  messages,  but,  in  general, 
the  exposure  probability  for  men  is  greater  than  the  expo¬ 
sure  probability  for  women  and  the  ratio  is  nearly  constant. 

We  make  the  assumption  that  this  ratio  is  constant 
enough  for  us  to  proceed  with  the  model.  In  this  case,  we 
wish  to  describe  all  the  messages  in  a  theme  and  vehicle 


271 


group,  each  with  its  two  exposure  probabilities,  in  terms 
of  three  kinds  of  numbers,  i.e.,  the  message  exposure 
probability  for  men  (Pjji)#  the  message  esqposure  probability 
for  women  (P  . ),  and  a  format  factor  for  the  message  (F.). 

W1  J 

(To  simplify  the  data  processing,  we  have  grouped  all  the 
vehicles  into  one  group;  thus  we  may  drop  the  subscript 
i  on  the  and  Pw^. )  Let  P^,  PWj  represent  the  known 
(from  the  linear  regression)  conditional  exposure  probabili¬ 
ties  for  the  j message  and  P  ' ,  P„. '  represent  the  prob- 

M]  "3 

abilities  which  will  be  generated  from  the  proper  choice 
of  PM,  Pw,  and  F_.  where  E^,  P^  will  be  the  average  condi¬ 
tional  exposure  probabilities  for  the  vehicle  group,  and 

i.1. 

Fj  the  format  factor  for  the  j  message.  Thus 

P  . '  »  P  -F .  (VII— 3) 

Mj  M3 

and 


P  *F. 
W  3 


(VI 1-4) 


We  want  to  choose  values  of  PM,  Pw,  and  the  that 

minimize  the  difference  between  P„ .  and  P  . ' ,  and  between 

M3  M3 

P„ .  and  F  ' .  Thus  the  problem  becomes 
W3  Wj 

minimize  l [ (PMj-pMj ’ ) 2  +  (pWj“pWj')2] 

with  respect  to  PM»  Pw,  and  Fj.  Replacing  the  PM j ' ,  PWj’ 
with  equations  (VII-3)  and  (VII-4)  in  the  function  to  be 
minimized  and  taking  partial  derivatives  results  in  the 
following  equations 


2  (F. *P„-P„.)  =  0  (sum  over  all  messages) 
.  3  M  M3 
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£  (F.*PU-PM.)  =  0  (stun  over  all  messages) 
j  3  "  WJ 

Pj  -  <VPMj  +  Vwj  )/(PM2  +  Pw2)  (for  each  message). 

We  have  solved  these  equations  by  choosing  values 
of  PM  and  Pw#  calculating  the  resulting  Fj,  testing  the 
tw  sums,  adjusting  the  values  of  PM,  Pw,  etc.  This  has 
been  done  for  each  of  the  themes  in  the  scenario,  in  each 
case  considering  ail  the  vehicles  as  one  vehicle  group. 

The  outputs  are  the  constant  probabilities,  i.e.,  the 
message  exposure  probability  for  men  and  the  message 
exposure  probability  for  women,  and  a  format  factor 
assigned  to  each  of  the  messages  in  the  scenario,  such 
that  the  product  of  the  format  factor  and  the  constant 
probabilities  over  all  the  messages  comes  as  close  as 
possible  to  reproducing  the  values  generated  from  the 
regression. 

The  choice  of  the  average  audience  (A)  for  each 
these  is  the  last  remaining  problem  in  transforming  the 
data  to  fit  the  computer  model.  It  will  be  recalled  that 
this  average  audience  is  the  proportion  of  the  vehicle 
audience  who  see  the  theme  on  the  average,  or  the  audience 
for  a  message  with  a  format  factor  of  1.0.  In  this  case, 
the  message  exposure  probabilities  for  men  and  women  will 
be  just  the  average  values  generated  by  the  procedure 
above.  If  we  let  VM£,  Vw^  represent  the  number  of  men 


and  women  in  the  vehicle  audience  of  the  itl1  vehicle,  then 
this  proportion  A  can  be  written  in  the  following  manner: 


P  . *V  .  +  P  . *V  . 
Mi  Mi  Wi  Wi 


+  V 
Wi 


(This  is  just  a  rearrangement  of  equation  (VII-1)  above.) 

Now  in  general,  the  number  of  women  and  the  number  of 
men  in  the  vehicle  audience  are  different  from  vehicle  to 
vehicle;  however,  it  is  generally  true  that  the  number  of 
women  in  the  audience  of  any  vehicle  exceeds  the  number  of 
men  and  that  the  ratio  (vwi ^Mi >  l*-es  between  the  values 
of  1.0  and  3.0.  Therefore,  for  each  theme  with  its  associ¬ 
ated  average  values  PM  and  Pw,  we  have  calculated  the  value 
of  A,  the  proportional  average  audience,  for  values  of 
vWi/vji-L  =  1.  2,  and  3.  These  are  presented  in  the  table 
below.  We  see  that  the  value  of  A  is  relatively  constant 
as  the  ratio  varies  from  1  to  3.  For  example,  for  Thane 
1,  the  value  for  A  varies  only  from  .2000  to  .1700.  In 
general,  the  change  in  A  within  a  theme  for  various 
ratios  of  V^/V^  is  not  nearly  so  large  as  the  change 
in  A  for  the  various  themes.  In  order  to  simplify  matters 
then,  we  have  used  the  value  of  A  at  V__.  /VM.j  =  2.0  for 
each  theme.  This  is  the  average  value  for  the  proportion 
of  the  vehicle  audience  who  see  the  theme.  Thus,  for  each 
theme  we  have  produced  a  ratio  of  P){  to  P^r,  a  value  of  A, 


and  a  format  factor  for  each  message.  This  has  probably 
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Computations  were  not  performed  for  themes  7,  8,  9,  and  10,  since  there  themes 
not  used  in  the  scenario. 


caused  some  distortion  in  the  data;  however,  this  dis¬ 
tortion  does  not  seem  to  he  of  a  magnitude  to  cause 


great  concern,  given  the  quality  of  the  data  before 

13 

the  model  was  imposed  upon  it.  In  the  simulation  in 
addition  to  the  ratio  value  for  sex,  i.e.,  the  specifi¬ 
cation  of  message  exposure  probabilities  for  men  and 
women,  we  have  included  ratios  for  age,  education,  and 
socioeconomic  status  (SES ) .  These  ratios  are  derived 
from  data  reported  by  Schramm  and  White  from  a  1949 
study  of  a  random  sample  of  746  readers  in  an  Illinois 

city  of  a  population  of  100,00  and  for  a  local  evening 

14 

paper  of  approximately  65,000  circulation.  All  the 
respondents  in  this  study  were  readers  of  the  news¬ 
papers  . 

Table  IX  in  the  study  reports  the  percentage  of 
public-affairs  news  read,  broken  down  by  sex  for  education. 


Actually,  a  more  serious  distortion  may  arise  from 
an  error  in  the  writing  of  the  scenarios  for  the  simulation 
which  occurred  at  this  point.  Inadvertently,  the  ratio  of 
P^/Pw  for  all  the  themes  was  set  equal  to  the  value  of  the 
first  theme.  This  value,  as  seen  from  the  table,  is  1.810 
and  is  in  general  not  too  deviant  from  the  values  for  most 
of  the  themes.  However,  for  Themes  11,  12,  and  15,  this 
value  may  cause  a  significant  error  in  the  distribution  of 
exposures  between  men  and  women.  At  any  rate,  the  magni¬ 
tude  of  the  error  attributable  to  this  oversight  in  writing 
the  scenario  is  not  evident. 

^-4  The  data  reported  in  Wilbur  Schramm  and  David  M. 
White,  "Age,  Education,  and  Economic  Status  as  Factors  in 
Newspaper  Reading"  Journalism  Quarterly.  (June,  1949), 
also  reprinted  in  Wilbur  Schramm  (ed.).  Mass  Communications 
(Urbana:  University  of  Illinois  Press,  1949).  About  half 
the  circulation  of  this  evening  paper  was  in  the  city  itself, 
and  the  sample  of  746  readers  was  obtained  by  selecting  home 
addresses  throughout  the  city  on  a  random  basis. 
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Table  VII-6 .  Ratios  of  Percentages  of  Public  Affairs  News 

Read  for  Several  Population  Subgroups. 


Dimension 

Ratio 

Ratio  of  Percentage  for  Men  to 
Percentage  for  Women  Within 
the  Dimension  Level 

Dimension 

Ratio  (PM/PW! 

Education 

Education 

Colleqe 

1.176 

Grade  School 

1.696 

High  School 

High  School 

1.480 

Hiqh  School 

1.285 

Grade  School 

College 

1.153 

Age 

Age 

10  -  19 

.819 

20  -  39 

.824 

20  -  29 

1.328 

40  - 

30  -  39 

1.202 

40  -  49 

1.133 

50  -  59 

1.396 

60  - 

1.377 

SES 

SES 

Highest 

1.303 

Highest 

1.283 

Middle 

2 

1.243 

.Middle 

1.295 

Lowest 

3 

1.508 

Lowest 

1.356 

Source:  See  text  above 
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age,  and  economic  status  groups.  We  note  that  these 
percentages  of  exposure  for  men  and  women  within  each 
of  the  classes  of  education,  age,  and  economic  status 
are  summaries  over  a  large  number  of  news  stories, 
each  with  its  own  format  and  themes  in  terms  of  our 
simulation  model.  Making  the  assumption  that  the  effects 
of  formating  somehow  cancel  out,  or  that  the  net  format 
factor  in  this  case  is  1.0,  and  that  the  average  audience, 
as  a  proportion  of  the  vehicle  audience,  is  approximately 
equal  to  the  figures  for  our  study  (or  that  the  ratios 
hold,  even  if  the  average  audiences  are  not  the  same), 
we  have  calculated  ratios  of  conditional  exposure 
probabilities  to  this  news  from  the  table  and  from  the 
sizes  of  each  of  the  categories.  In  Table  VII-6  we 
present  these  ratios  for  sex  by  each  of  the  other  three 
dimensions  and  for  the  dimensions  of  education,  age, 
and  economic  status. 

*7e  see  that,  according  to  this  study,  our  estimate 
of  the  ratio  of  sex  probabilities  is  high,  that  in  no 
category  of  this  study  does  it  reach  1.801,  but  that  it 
ranges  from  a  low  value  of  .819  in  the  case  of  the  age 
group  10-19,  to  1.696  for  grade  school -educated  individu¬ 
als.  Omitting  the  10-19  age  group  (which  is  not  included 
in  the  simulation  population),  the  average  ratio  for  sex 
is  about  1,30,  which  is  to  be  compared  to  the  ratios 
ranging  from  1.36  to  1.93  for  the  themes  presented  in 
Table  VII -5. 
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The  ratios  for  age,  education,  and  socioeconomic 
status  as  presented  in  the  table  were  used  for  the  simula¬ 
tion  values  for  each  of  the  themes  in  the  scenario.  (This 
da -a,  incidentally,  is  corroborated,  according  to  Schramm, 
by  data  collected  by  the  University  of  Minnesota  survey 
on  the  Minneapolis  Star  Tribune  and  by  Eugene  Liner  in  an 
Illinois  thesis.)  Thus,  for  better  or  for  worse,  we  have 
used  what  data  seemed  aval 1  able  and  plausible  to  add  addi¬ 
tional  structure  to  the  media  system  in  the  simulation 
model. 

It  would  be  interesting  to  find  out  how  sensitive 
the  simulation  is  to  changes  in  the  values  of  these  parame¬ 
ters?  however,  even  without  any  sensitivity  testing,  we 
believe  that  these  values  realistically  reflect  the  kinds 
of  structure  in  the  real  media  system. 

Anticipating  one  of  the  more  interesting  outcomes 
of  the  simulation,  let  us  recall  that  for  the  radio  ve¬ 
hicles  we  have  specified  only  the  magnitudes  of  the  audi¬ 
ences  and  the  differences  in  their  distribution  between 
males  and  females.  In  specifying  only  these  factors  and 
leaving  it  to  the  parameter  estimation  routine  to  produce 
the  distribution  of  the  audiences  for  each  of  these  radio 
vehicles  across  the  population  types,  we  have  created 
exactly  similar  distributions  of  the  audiences  of  these 
vehicles  across  the  other  dimensions  of  the  population, 
i.e.,  each  of  the  radio  vehicles  will  have  the  same  dis¬ 
tribution  of  men  and  the  same  distribution  of  women  across 
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each  of  the  other  dimensions  of  the  population  even  though 
the  total  number  of  men  and  the  total  number  of  women  may 
differ  from  vehicle  to  vehicle.  In  addition,  in  choosing 
values  of  the  ratios  that  will  be  constant  across  all  themes, 
we  produce  the  same  relative  distribution  of  message  exposure 
probabilities  across  the  population  types  for  each  theme 
except  for  the  variations  introduced  by  the  format  factors 
of  the  messages.  The  average  probability  of  exposure  for 
the  population  to  a  message  in  the  theme  may  vary  from 
theme  to  theme  but  the  relative  probabilities  will  remain 
much  the  same  from  theme  to  theme  across  the  population 
types.  We  shall  see  the  consequences  of  this  fact  later 
in  the  analysis  of  the  distribution  of  exposures  from  the 
themes . 

Triggering  and  Duplication 

When  simulating  a  scenario  of  a  crisis  situation, 
the  researcher  may  often  wish  to  modify  the  vehicle  ex¬ 
posure  probabilities,  the  message  exposure  probabilities, 
the  format  factors,  or  the  products  of  the  three,  i.e., 
the  net  probabilities  of  esqposure  to  the  message,  in 
order  to  account  for  heightened  interest  in  the  theme. 

In  the  simulation  we  allow  the  researcher  either  to  in¬ 
crease  or  decrease  any  of  these  probabilities,  either  as 
a  function  of  the  person's  exposure  to  some  theme  or  as 
a  function  of  the  time  period  (e.g.,  in  order  to  simulate 
increased  attention  due  to  the  word-of-mouth  spread  of 
information) .  It  is  also  possible  to  reverse  the 
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triggering  changes  or  to  impose  cyclical  changes  in  the 
probabilities.  We  have  not  used  any  triggering  of  changes 
in  exposure  probabilities  in  the  present  simulation,  since 
there  was  no  obvious  crisis  situation  relating  to  the 
themes  during  the  period  of  the  simulation;  therefore 
we  will  not  go  into  the  complexities  of  the  triggering 
mechanisms  in  great  detail.  However,  it  is  necessary  to 
understand  the  notion  of  exposure  classes  in  order  to 
understand  how  the  duplication  statistics  are  produced 
in  the  simulation. 

The  triggering  of  probability  changes  on  the  basis 
of  previous  exposure  to  some  theme  or  other,  requires 
the  retention  in  the  simulation  for  each  member  of  the 
population  and  every  time  period,  of  some  indication  of 
the  person's  level  of  prior  exposure  to  the  theme.  To 
maintain  the  individual's  exact  number  of  exposures  for 
every  theme  and  time  period  would  require  an  impossible 
amount  of  storage  space  in  the  computer ;  therefore,  we 
have  limited  the  number  of  themes  which  may  act  as  trig¬ 
gering  themes  to  (at  most)  the  first  three  themes  and 
introduced  the  notion  of  exposure  classes,  rather  than 
exact  number  of  exposures.  The  researcher  may  set 
three  levels  of  exposure,  which  define  the  boundaries 
of  four  exposure  classes  (the  lower  boundary  of  the  first 
exposure  class  is  taken  as  zero  exposures  and  the  upper 
boundary  of  the  fourth  exposure  class  as  equal  to  the 
number  of  messages  in  the  scenario)  and  he  may  request 
triggering  based  upon  the  person's  exposure  class. 


This  simplified  handling  of  exposure  classes  allows  the 
use  of  only  two  binary  digits  (bits)  per  person  to  define 
the  exposure  classes  therefore,  for  a  population  of  3,000 
people  and  three  themes,  the  total  storage  required  for 
one  time  period  is  only  504  36-bit  words. 

The  simulation  is  structured  in  such  a  way  that  in 
order  to  produce  duplication  between  themes  it  is  necess¬ 
ary  at  least  to  define  trigger  themes.  This  arises  out 
of  the  following  problem:  in  order  to  maike  the  calcula¬ 
tion  of  the  audience  duplication  between  any  two  themes, 
one  must  know  for  each  individual  at  each  time  period  his 
cumulative  non-exposure  probability.  Then,  one  minus 
this  figure  is  the  probability  of  his  being  exposed  at 
all  to  the  theme;  the  product  of  these  probabilities  for 
any  N  themes  is  the  probability  of  the  individual.' s  ' being 
exposed  at  least  once  to  each  of  the  N  themes.  The  ex¬ 
pected  number  of  people  exposed  at  least  once  to  all  of 
the  themes  considered  in  the  duplication  is  just  the  sum 
of  these  probabilities  for  all  persons  in  the  cell.  In 
theory  this  is  fine,  but  in  practice  it  means  that  one 
must  maintain  the  cumulative  non-exposure  probability  for 
every  individual,  for  every  time  period,  for  ,-ivery  theme, 
and  possibly  by  media  type,  etc.  This  would  require  an 
overwhelming  amount  of  storage  and  is  simply  impossible 
to  handle.  Therefore,  we  have  settled  upon  another, 
cruder  method  for  handling  the  problem.  First  of  all, 
we  have  limited  the  kind  of  duplication  statistics  that 
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can  be  obtained  to  the  duplication  between  any  of  a  set  of 
trigger  themes,  and  any  one  of  the  other  themes.  That  is, 
one  may  ask  for  duplication  between  any  trigger  theme  and 
any  other  theme  (including  another  trigger  theme)  for  any 
time  period.  Since  the  cumulative  non-exposure  probabilities 
are  available  for  any  theme  at  its  running,  the  model  there¬ 
fore  requires  additionally  only  knowledge  of  the  cumulative 
non-exposure  probabilities  of  one  of  the  trier  -  themes 
for  the  particular  time  period. 

Therefore,  we  must  have  some  way  of  getting  the 
cumulative  non-exposure  probabilities  for  every  trigger 
theme  for  each  time  period,  for  as  many  as  three  trigger 
themes  and  any  number  of  time  periods.  We  have  allowed 
the  researcher  to  define  three  exposure  boundaries  which 
define  four  exposure  classes  (class  0,  1,  2,  or  3)  depend¬ 
ing  upon  the  cumulative  expected  number  of  exposures  to 
the  theme  through  the  current  time  period.  The 
cumulative  expected  number  of  exposures  is  just  the  sum 
of  all  the  exposure  probabilities.  By  knowing  the  class 
of  a  person,  we  can  make  an  estimate  of  the  sum  cf  the 
number  of  his  exposure  probabilities  through  the  current 
time  period.  For  instance,  for  a  person  in  the  lowest 
exposure  class,  we  take  the  average  of  zero  exposures 
and  the  number  of  exposures  at  the  first  class  boundary 
as  an  estimate  of  his  cumulative  number  of  exposures 
through  the  time  period.  (Obviously  this  is  a  rough 
estimate  and  it  is  best  fox  the  case  when  the  actual 
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expected  number  of  exposure  events  for  the  person  lies 
midway  between  the  class  boundaries.)  Dividing  this 
estimate  of  the  cumulative  number  of  exposures  of  the 
person  by  the  total  number  of  message  events  through 
the  current  time  period,  gives  the  estimate  of  the 
person's  average  exposure  probability.  We  estimate  the 
cumulative  non-exposure  probability  from  this  average. 

A  first  estimate  assumes  that  the  true  exposure  proba¬ 
bility  for  each  event  was  equal  to  the  average  exposure 
probability.  Each  non-exposure  probability  is  one  minus 
the  exposure  probability  and  the  product  of  all  the  non¬ 
exposure  probabilities  is  the  cumulative  non-exposure 
probability. 

We  can  show  that  this  estimate  of  the  non-exposure 
probability  is  the  highest  possible  value:  if  each  of  the 
exposure  probabilities  were  equal  to  the  average,  then  the 
product  of  K  individual  non-exposure  probabilities  would 
be  a  product  of  K  equal  non-negative  numbers.  If  the 
exposure  probabilities  were  actually  distributed  about 
the  mean,  the  product  would  be  a  product  of  K  non-negative 
numbers,  some  or  all  of  which  are  not  equal  to  the  mean, 
although  the  average  of  these  numbers  must  equal  that 
mean.  The  first  product  is  the  largest  possible.  If  wa 


let  (i  =  1,  2,  .  .  .K)  represent  the  K  probabilities, 

K 

then  we  wish  to  find  the  maximum  of  the  product  ^ 


subject  to  the  constraint  that  the  sum  of  the  probabili- 

K 

ties  equal  a  consta.it  (IP-  =  C).  We  form  the  expression 

•  i- 

i=l 
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K  K 

TT  P.  -  X  (  Z  P.  -  C) 


i=l  -  i=l  * 

where  X  is  a  Lagrangian  multiplier.  Setting  the  partial 
derivatives  equal  to  zero 


3 

5FT 


K  K  I  K 

TT  Pi  -  M  I  Pi  _  C)  =  0  =  TT  P.  “  X,  j=l,2,...K 
lx-l  1=1  J  A 
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W~ 


[K  K  1  K 

ir  P.  -  X(  Z  P.  -  C)1  =  0  =  Z  p. 

i=l  1  i-1  1  J  i-1  1  " 


TT  j?  •  “  X, 

i*j  1 

j=l ,2 , . . .K 

(VII-5) 

K 

V*  -  c 
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We  take  any  two  of  the  K  equations  (VII-5) 
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Dividing  the  first  by  the  second 
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Therefore  the  maximum  value  of  the  produce  occurs  when  all 
of  the  probabilities  are  equal.  The  cumulative  nonexposure 


4  1 


probability  thus  calculated  is  an  upper  bound. 

A  second  calculation  assumes  that  the  exposure 


I 


p  robabilities  distribute  themselves  about  the  average 
exposure  probability  Which  we  have  just  calculated.  We 
generate  a  symmetric  distribution  of  probabilities  about 
the  average  exposure  probability  and  calculate  another 
estimate  of  the  cumulative  non-exposure  probability  from 
this  distribution.  Since  the  distribution  is  widely  dis¬ 
tributed  about  the  mean,  the  individual  probabilities 
will  be  quite  unequal.  Since  the  resulting  estimate  of 
the  cumula  tive  non-exposure  probability  will  be  much 
lower  than  the  previously  calculated  value  and  probably 
lower  than  the  actual  value,  we  take  the  average  of  the 
two  calculated  values  as  the  best  estimate  of  the  non¬ 
exposure  probability.  (We  have  worked  by  hand  several 
problems  which  indicate  that  this  seems  to  give  a  good 
estimate  of  the  non-exposure  probability. ) 

Having  calculated  the  non-exposure  probabilities 
for  one  of  the  Lrigger  themes ,  the  probability  that  a 
person  is  exposed  at  least  once  to  both  the  trigger 
theme  and  any  other  theme  is  simply  the  product  of  his 
cumulative  exposure  probabilities  (one  minus  the  cumula¬ 
tive  non-exposure  probability)  for  each  theme.  The  sum 
of  these  products  over  all  members  of  the  cell  is  the 
expected  number  of  people  in  the  cell  who  have  been 
exposed  at  least  once  to  both  of  the  two  themes. 

The  definition  of  the  exposure  classes  serves  two 
purposes  then:  one  purpose  is  to  allow  triggering  of 
probability  changes  based  on  previous  exposure  to  a  given 


theme.  The  second  allows  the  estination  of  the  non¬ 
exposure  probabilities  in  order  to  calculate  duplica¬ 
tion  between  themes.  In  the  first  case,  the  classes  •» 

would  be  chosen  in  such  a  way  as  to  represent  the  re¬ 
searcher's  feelings  about  the  thresholds  of  change  in 
attention  to  one  theme  because  of  exposure  to  another  * 

theme.  One  might  imagine  the  upper  boundary  of  the 
lowest  exposure  class  being  set  at  about  1.0  expected 
exposures,  if  the  theme  were,  for  instance,  new3  of  the 
assassination  of  Kennedy  and  the  researcher  anticipated 
that  simply  getting  this  message  would  be  sufficient  to 
significantly  heighten  exposure  probabilities  for  many 
other  kinds  of  themes. 

However,  consider  the  case  for  the  estimation  of 
the  non-exposure  probabilities.  The  first  step  is  essen¬ 
tially  an  averaging  procedure,  i.e.,  the  person's  mean 
exposure  probability  is  estimated  as  the  average  of  the 
upper  and  lower  bound  of  his  ejqaosure  class.  The  first 
three  exposure  classess  are  bounded  on  both  ends  by 
either  0.0  (for  the  lowest  class)  or  the  values  of  the 
boundaries  for  each  class  which  were  chosen  by  the  re¬ 
searcher.  However,  the  upper  bound  for  the  highest 
exposure  class  can  be  only  the  number  of  messages  thus 

a 

far  processed.  Therefore,  one  must  use  another  method 

«• 

of  estimating  the  number  of  exposures  for  people  who 
fall  in  this  high  class.  The  estimation  procedure  is 
very  crude;  it  amounts  simply  to  assuming  that  the 
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average  exposure  probability  was  about  .5  for  each  of 
the  exposure  events  for  persons  falling  into  the  high 
class.  This  assumption,  while  very  rough,  probably  is 
not  critical  for  the  calculation  of  the  duplication  be¬ 
tween  themes,  since  the  probability  of  a  person's  being 
exposed  to  the  trigger  theme,  given  that  he  is  in  the 
highest  class  of  exposures,  is  very  high  regardless  of 
the  exact  number  of  exposures . 

For  the  purpose  of  duplication  the  boundaries  of 
the  lowest  exposure  classes  should  form  a  narrow  grid  in 
order  to  distinguish  those  people  who  have  relatively 
little  exposure  and,  therefore,  relatively  low  cumulative 
exposure  probabilities.  When  one  does  the  mathematics, 
one  finds  that  the  possible  range  of  the  cumulative  prob¬ 
ability  of  exposure  to  the  theme  is  very  small  once  the 
person's  cumulative  expected  number  of  exposures  is  at 
least  one.  For  example ,  if  the  person  has  one  expected 
exposure  resulting  from  one  message  event,  the  probability 
of  exposure  to  the  theme  is  identically  1.0,  since  his 
probability  of  exposure  for  that  particular  message  must 
have  been  1.0.  If  the  person  had  one  expected  exposure 
in  five  events  so  that  all  the  exposure  probabilities 
could  have  been  of  the  order  of  .2,  the  minimum  possible 
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probability  of  exposure  to  the  theme  is  .67^ ,  and  even  if 
the  person  had  only  one  expected  exposure  in  one  thousand 
events,  the  minimum  value  for  the  probability  of  exposure 
to  the  theme  is  still  .63.  After  a  person  has  (say)  ten 
expected  exposures,  even  for  1,000  message  events,  his 
cumulative  probability  of  exposure  to  the  theme  is  only 
very,  very  slightly  less  them  1.0.  Therefore,  for  the 
purposes  of  discriminating  between  those  who  are  probably 
exposed  to  the  theme  and  those  not  likely  exposed  to  the 
theme,  any  number  of  cumulative  expected  exposure  events 
greater  than  one  is  almost  a  sure  indication  that  the  per¬ 
son  is  exposed  to  the  theme,  i.e.,  his  cumulative  prob¬ 
ability  of  exposure  to  the  theme  is  very  high.  For  this 
reason,  we  have  set  the  exposure  class  boundaries  at  the 
low  levels,  0.2,  0.6,  and  1.2  expected  exposures. 


Let  Pi  be  the  probability  of  exposure  to  the  i*"*1 
message.  Then  the  non-exposure  probability  is  1-Pi  and 
for  five  messages,  the  probability  of  not  being  exposed 
at  all  is  (1-Px) • (1-Po) • (I-P3) • (I-P4) * (I-P5) .  Obviously, 
if  any  of  the  probabilities  of  exposure  equals  1.0,  this 
cumulative  non-exposure  probability  is  equal  to  zero;  it 
has  a  maximum  value  when  all  the  Pi  are  equally  small. 

Now  the  number  of  expected  exposures  (by  assumption,  1.0) 
is  just  the  sum  of  the  five  Pi;  therefore  the  values  of 
the  Pi  which  maximize  the  cumulative  non-exposure  probability 
are  0.2  (i.e.  five  equally  small  values  which  sum  to  1.0) 
and  tVj  maximum  cumulative  non-exposure  probability  is 
0.32763  (0.83) ~  Thus  the  minimum  cumv.iative  probability 
of  being  exposed  at  least  once  is  0.67232  (1.0  -  0.32768). 
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CHAPTER  VIII 

THE  SIMULATED  EXPOSURES  TO  THE  CINCINNATI 
MASS  MEDIA  SYSTEM 

In  the  previous  chapters,  we  have  specified  the  model 
to  the  Cincinnati  mass  media  system,  creating  the  vehicles 
and  their  audiences,  and  we  have  recorded  the  actual  mes¬ 
sages  in  the  system  over  the  six  months  with  their  message 
exposure  probabilities.  At  this  point,  we  are  ready  to 
run  the  simulation.  In  the  first  part  of  this  chapter 
we  examine  the  consistency  with  which  the  model  has  syn¬ 
thesized  the  input  data,  by  use  of  exposure  statistics 
from  three  simple  trial  scenarios.  We  will  find  that  the 
model  does  reproduce  the  input  data  and  produce  plausible 
exposures  to  these  three  scenarios.  Next,  we  will  look  at 
the  exposure  results  for  the  twelve  themes  in  the  scenario, 
examining  the  growth  and  distribution  of  exposure,  its 
duplication  across  themes,  and  the  relative  importance  of 
the  various  media  vehicles.  Finally,  we  will  attempt  to 
correlate  exposures  to  the  twelve  themes  with  indices  of 
actual  change  from  the  NORC  panel.  We  close  with  several 
comments  about  the  relative  importance  of  the  different 
kinds  of  input  data,  some  suggestions  toward  increasing 
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the  usefulness  of  the  model,  and  an  admonition  or  two  for 
designers  of  simulation  models. 

Validation  of  Several  Parts  of  the  Simulation  Model 

If  the  simulation  model— our  consistency  machine — is 
to  be  of  any  use  in  predicting  exposures  of  populations  to 
messages  in  the  communication  vehicles,  then  it  should  at 
a  minimum  be  consistent  with  the  input  data  which  specify 
the  parameters  of  the  model.  The  simulation  should,  in 
addition  to  synthesizing  and  integrating  the  input  data, 
also  reproduce  or  describe  accurately  the  input  data.  This 
is,  in  Hanna's  terms,  the  descriptive  validity  of  the 
model. ^  In  this  section,  we  will  look  at  how  well  the 
parameter  estimation  routine,  cumulation  routine,  the 
assignment  of  probabilities  to  cells,  and  the  duplication 
routine  reproduce  that  known  data  which  was  input  in  order 
to  specify  the  model  to  a  given  population  and  mass  media 
system.  We  begin  with  the  parameter  estimation  iteration 
in  the  first  link  of  the  simulation. 

The  Mosteller  Parameter  Estimation  Iteration 


Does  the  iteration  process  for  integrating  partial 
data  succeed  in  reproducing  a  population  and  vehicle  audi¬ 
ences  which  duplicate  the  input  data?  The  answer  to  this 

^"Hanna,  Information  -  Theoretic  Techniques 
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question  is,  yes,  the  iteration  process  does  produce  con¬ 
sistent  population  and  vehicle  audiences  within  the  margin 
of  inaccuracy  which  one  finds  in  the  input  data.  Let  us 
be  more  explicit  about  the  inaccuracies  in  the  input  data. 
The  population  tables  which  were  used  to  create  the  simula¬ 
tion  population  consisted  of  all  the  three  dimensional 
tables  possible  from  the  five  population  dimensions. 

There  are  ten  of  these  three-dimensional  tables  and  they 
are  not  totally  consistent  with  each  other  because  of  non¬ 
response  or  not  ascertainable  responses  in  the  survey 
data.  For  example,  the  total  population  in  these  subtables 
ranges  from  1097  respondents  to  1112  respondents.  Thus, 
we  have  a  variation  of  at  most  15  in  the  sum  of  the  fre¬ 
quencies  in  the  subtables.  Since  these  tables  of  about 
1100  respondents  are  used  to  generate  a  population  of 
size  2000,  meaning  that  each  of  the  values  in  the  original 
table  must  be  multiplied  by  a  factor  of  about  1.8,  we  see 
that  ther5  are  significant  inconsistencies  possible  in  the 
cells  of  these  subtables.  The  iteration  routine  will  pro¬ 
duce  a  population  which  has  an  average  error  which  may  be 
anywhere  from  half  as  large  to  just  as  large  as  the  dis¬ 
crepancies  in  the  input  tables.  These  errors  are  then 
multiplied  by  the  factor  of  1.8  in  normalizing  the  popula¬ 
tion  to  the  value  of  2000,  and  then  perhaps  some  slight 
additional  error  is  made  when  the  nonintegral  values  cure 
rounded  to  produce  integral  numbers  of  respondents  in  each 
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cell  of  the  population.  Therefore,  in  evaluating  the  suc¬ 
cess  of  the  iteration  routine,  we  must  remember  that  the 
input  values  themselves  cure  not  consistent,  but  may  have 
subtable  cells  which  are  inconsistent  by  as  much  as  ten 
or  fifteen  respondents  which  under  the  normalization  must 
be  multiplied  by  a  factor  of  1.8.  Therefore,  we  e3q?ect 
imperfect  agreement  but  some  sort  of  average  values  for 
the  population  values  produced  by  the  iteration  routine, 
the  normalizing  to  the  simulation  population,  and  the 
rounding  to  the  cell  values.  In  Table  VIII -1  we  compare 
one  of  the  input  tables  from  the  NORC  survey  data  with  the 
population  values  produced  by  the  simulation.  In  order 
to  facilitate  the  comparison,  the  population  values  from 
the  survey  data  are  normalized  to  a  total  population  of 
2000. 

In  comparing  the  two  tables,  we  see  that  for  some  of 
the  large  cells  the  simulation  values  cure  larger  than  the 
survey  data  by  15  respondents  or  about  15  percent.  Note, 
for  instance,  the  differences  in  the  two  values  for  the 
males  with  high  school  education  and  low-middle  interest. 
In  the  light  of  our  discussion  above,  we  must  not  make 
the  mistake  of  considering  this  difference  between  the 
simulation  and  the  input  values  as  due  totally  to  an  error 
in  the  simulation  population  values,  since  the  input 
tables  are  themselves  inconsistent  and  the  iteration  pro¬ 
duces  a  population  which  attempts  to  reconcile  so  feu:  as 


147.60  152.53  106.69  406.82  61.38  198.00  115.29  374.67 
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possible  the  inconsistencies  in  the  input  data.  At  any 
rate,  a  comparison  of  the  two  tables  seems  to  show  that 
the  iteration,  normalizing,  and  rounding  process  fairly 
well  duplicates  the  distribution  of  the  population  across 
the  cells  as  given  by  the  input  data. 

In  Table  VIII-2  we  make  a  similar  comparison  between 
the  audiences  in  the  population  subgroups  of  the  morning 
Enquirer  as  generated  by  the  NORC  survey  data  and  the  same 
audiences  after  the  iteration  technique  has  synthesized 
the  ten  survey  subtables. 

The  table  of  audiences  indicates  about  the  same  order 
of  error  as  in  the  previous  table  for  the  population  data. 
Again,  we  note  that  the  differences  between  the  values 
generated  by  the  simulation  and  the  values  input  from  the 
survey  data  are  not  entirely  error,  since  the  simulation 
is  attempting  to  reconcile  inconsistencies  in  the  input 
data.  The  parameter  estimation  technique  does  seem  to 
synthesize  the  input  tables  correctly.  Of  course,  what 
it  cannot  do  is  produce  any  unknown  higher  order  inter¬ 
actions  . 

Trial  Scenarios  to  Validate  the  Cumulation  and 
Probability  Assignment  Routines 

To  test  whether  the  simulation  was  correctly  synthe¬ 
sizing  the  input  data  (in  the  sense  of  being  able  to  re¬ 
produce  it,  at  a  minimum)  and  to  judge  the  plausibility 


of  its  output,  we  ran  several  simplified  trial  scenarios 
before  proceeding  to  the  actual  content  of  the  Cincinnati 
ma^s  media  system.  In  order  to  plot  the  penetration  of 
the  media  vehicles  in  a  single  weekday,  the  first  scenario 
divides  the  day  into  thirteen  time  periods  and  uses  a 
single  story  in  each  of  the  vehicles  throughout  the  day. 

We  could  describe  this  scenario  as  a  one-day  average  news 
story.  A  second  trial  scenario  represents  a  three  weekday 
average  story,  dividing  each  day  into  six  time  periods  (for 
a  total  of  eighteen  time  periods  over  the  three  days)  with 
a  single  message  in  each  vehicle  each  day.  The  third  trial 
scenario  represents  a  more  important  three-day  news  story. 
Thir.  scenario  differs  from  the  average  three-day  story  in 
that  each  newspaper  now  carries  three  messages  relating  to 
the  story  at  each  appearance  of  the  newspaper  over  the 
three  days. 

The  trial  scenarios  were  designed  not  only  to  test 
the  effect  of  news  events  of  differing  magnitude  and  dura¬ 
tion,  but  also  to  display  the  effect  of  the  message  ex¬ 
posure  probabilities.  Recall  that  the  probability  dis¬ 
tributions  over  the  population  produced  by  the  first  half 
of  the  simulation,  attempt  to  reproduce  the  vehicle  audi¬ 
ences:  it  is  in  the  second  half  of  the  simulation  with 
the  actual  messages,  their  format  factors,  the  ratios  of 
exposure  probabilities  by  sex,  age,  etc.,  that  we  can  cal¬ 
culate  message  exposure  probabilities  which  are  actually 
conditional  probabilities  cf  exposure  to  a  message  in  a 


vehicle,  given  exposure  to  the  vehicle.  Thus,  by  running 
each  of  the  trial  scenaros  with  message  exposure  proba¬ 
bilities  equal  to  1.0,  we  produce  the  vehicle  audiences 
as  output;  if  we  run  the  trial  scenarios  with  more  realis¬ 
tic  (smaller)  message  exposure  probabilities,  we  produce 
audiences  of  a  series  of  messages  in  these  vehicles. 

For  each  of  the  trial  scenarios  above,  we  have  used  both 
sets  of  message  exposure  probabilities,  producing  in 
effect  six  scenarios,  three  of  which  report  the  vehicle 
audiences  of  each  of  the  news  events  and  three  of  which 
report  plausible  exposures  to  messages  of  a  news  event  as 
carried  in  the  vehicles. 

To  summarize,  each  of  the  following  hypothetical 
scenarios  is  run  both  for  the  vehicle  audiences  and  for 
the  actual  news  event  audience: 

1.  One-day  average  story.  One  message  in  each  ve¬ 
hicle  during  the  day,  and  thirteen  time  periods. 

2.  Three-day  average  story.  One  message  in  each 
vehicle  for  three  consecutive  weekdays,  six  time 
periods  per  day. 

3.  Three-day  larger  story.  One  message  in  each 
radio  vehicle  and  three  messages  in  each  news¬ 
paper  for  three  consecutive  days,  six  time  per¬ 
iods  per  day. 


The  realistic  message  exposure  probabilities  and 
ratios  are  the  average  values  calculated  in  Chapter  VII. 


in  the  scenario  19  the 


Table  VIII-3  and  VIII-4  show  the  occurence  of  vehicles 
and  messages  by  time  period  and  the  realistic  format  fac¬ 
tors  and  ratios. 

Before  we  examine  the  growth  of  vehicle  and  message 
exposures  in  the  trial  scenarios,  we  might  first  ask  if 
the  simulation  is  correctly  reproducing  various  aspects 
of  the  input  data  such  as  the  average  audiences  and  cumu¬ 
lations  of  the  vehicles,  and  the  distribution  of  the  audi¬ 
ences  over  the  population  types. 

Both  of  these  questions  can  be  answered  affirmatively 
by  comparison  of  the  output  of  the  trial  scenario  with 
the  input  data.  In  the  trial  scenarios  the  Enquirer  is  de¬ 
fined  as  the  only  vehicle  in  media  type  1  (morning  news¬ 
papers)  .  Thus,  by  referring  to  the  output  tables  describ¬ 
ing  exposure  by  media  type,  we  can  observe  the  cumulative 
growth  of  the  vehicle  audience  of  the  Enquirer  (in  the 
scenarios  with  message  exposure  probabilities  equal  to 
1.0).  Table  VIII-5  shows  the  relevant  data.  The  input 
audience  for  the  Enquirer  was  about  39.20  percent  of  the 
population.  The  parameter  estimation  routine  compromised 
the  slight  inconsistencies  in  the  input  data  and  distribu¬ 
ted  an  audience  of  39.24  percent  across  the  population 
cells.  The  cumulation  routine  was  given  this  value  and 
input  data  describing  the  two-period  cumulative  audience 
of  the  Enquirer  as  45.51  percent  of  the  population.  It 
generated  2000  discrete  probabilities  which  produce  a 
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final  output  expected  audience  of  39.35  percent  and  a  final 
output  expected  two-period  cumulative  audience  of  45.52 
percent  of  the  population. 

Table  VTII-5.  Comparison  of  Input  Data  and  Simulation 
Output  Data  for  the  Enquirer. 


Output  from 

Parameter  Output  from  Trial  Run 
Statistic  Input  Estimation  Cumulation  (After 

Routine  Routine  Probability 

Assignment) 


Average 

Audience  39.20%a  39.24%  39.35%  39.35% 


Two-Period 

Cumulation  45.51%  45.51%  45.51% 


aData  are  audiences  as  percentage  of  the  population. 

From  the  Enquirer  output  data  we  can  also  judge  the 
success  of  the  simulation  in  reproducing  the  cell-by-cell 
average  audiences  produced  by  the  parameter  estimation 
process.  After  the  cumulation  routine  generates  the  2000 
discrete  probabilities,  these  are  then  assigned  to  the 
cells  of  the  population,  one  probability  for  each  member 
of  the  population.  This  assignment  is  random  except  for 
the  constraint  that  the  assignment  attempts  to  produce  an 
expected  cell  audience  equal  to  that  required  from  the 
parameter  estimation  routine.  Table  VIII-6  shows  th'i  ac¬ 
tual  output  from  a  trial  scenario  for  the  Enquirier . 


These  data  should  be  compared  with  the  results  of  the 
parameter  estimation  routine  in  Table  VIII-2,  showing  the 
same  audience  as  originally  input  and  as  produced  by  the 
parameter  estimation.  The  two  tables  of  audiences  are 
very  close,  the  largest  absolute  discrepancy  of  2.10  per¬ 
sons  occurring  for  the  cell  of  males  of  low-middle  inter¬ 
est  and  college  education.  Also,  comparing  the  marginals 
of  the  tables,  we  see  that  as  we  aggregate  or  collapse 

across  cells,  the  discrepancies  become  smaller  as  they 
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tend  to  cancel  each  other.  On  the  basis  of  this  compar¬ 
ison  we  conclude  that  the  process  of  generating  and  assign¬ 
ing  discrete  probabilities  does  faithfully  reproduce  the 
data  presented  to  it. 


The  Vehicle  Audience  Duplication 

The  final  important  segment  of  the  simulation  which 
we  have  not  yet  examined  for  validity  is  the  method  of  at¬ 
tempting  to  reproduce  empirical  duplication  figures  by  the 
assignment  of  probabilities  to  individuals  withiz.  th.*  cells. 

3 

Actually  the  dimensions  of  the  population  define  144 
population  types  or  cells,  and  the  random  assignment  pro¬ 
cess  attempts  to  reproduce  the  parameter  estimation  result¬ 
ing  audiences  for  each  of  these  cells.  Since  the  average 
size  of  each  of  these  144  cells  is  one-sixth  (144/24)  the 
size  of  the  24  cells  presented  above,  the  sample  of  proba¬ 
bilities  drawn  for  any  cell  is  only  one-sixth  as  large  as 
for  the  24  cells.  Therefore  we  would  expect  the  propor¬ 
tional  deviation  of  the  expected  audience  from  the  required 
values  to  be  greater  for  the  144  cells  than  for  the  24 
cells.  However,  this  matters  little,  since  we  will  always 
collapse  the  144  cells  to  some  much  smaller  number  for  out¬ 
put  and  analysis. 
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This  assignment  is  another  feature  which  adds  structure 
to  the  simulation:  it  defines  the  audience  duplicated 
between  two  vehicles  and  thus  the  reach  and  frequency  of 
exposure  of  various  combinations  of  media  vehicles.  Un¬ 
fortunately,  the  data  in  the  trial  scenarios  do  not  allow 
us  to  separate  out  the  effects  of  the  within  cell  non- 
random  assignment  of  probabilities  in  structuring  the 
duplication  of  the  population;  the  scenarios  would  have 
to  be  more  simple  than  those  which  we  have  run  in  order 
to  test  the  effect  of  the  nonrandom  within  cell  assign- 
sent.  However,  there  is  another  structure  which  we  sus¬ 
pect  is  much  more  important  in  determining  the  duplication 
between  any  set  of  vehicles,  namely,  the  assignment  of 
average  audiences  to  the  cells  of  the  simulation.  The 
choice  of  the  dimensions  of  the  population  was  based  on 
the  effect  of  these  dimensions  upon  the  audience  of  the 
vehicles,  i.e.  the  dimensions  were  chosen  to  define  the 
greatest  differences  between  population  types  in  their 
exposure  to  the  vehicles  of  the  mass  media  simulation. 

In  effect  these  dimensions  are  the  dimensions  which  most 
account  for  the  variability  in  exposure  of  individuals 
to  the  mass  media;  therefore  we  might  suspect  also  that 
these  dimensions  account  for  much  of  the  duplication  be¬ 
tween  any  pair  of  vehicles.  For  example,  if  there  are 
two  vehicles,  vehicle  A  and  vehicle  B,  each  of  which 
reaches  one-half  of  the  population  in  its  average  audience 
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and  if  the  distribution  of  the  audiences  of  these  vehicles 
were  completely  random  across  the  population,  then  we  would 
expect  that  on  the  average  25  percent  of  the  population 
would  be  exposed  to  both  the  vehicles  and  that  the  com¬ 
bined  audience  of  the  two  vehicles  or  the  coverage  of  the 
population  by  the  two  vehicles  would  be  75  percent  of  the 
population.  This  is  the  expected  audience  based  on  an 
absolutely  random  audience  distribution  throughout  the 
population.  Imagine,  however,  a  different  situation. 
Suppose  those  people  who  are  subscribers  or  quite  regular 
readers  to  vehicle  A  are  also  the  people  who  are  sub¬ 
scribers  and  quite  regular  readers  of  vehicle  B;  likewise 
the  infrequent  readers  of  one  vehicle  are  the  infrequent 
readers  of  another  vehicle.  In  this  case  we  expect  a 
greater  duplication  of  the  audiences  of  vehicle  A  and 
vehicle  B  than  from  the  random  model*  Some  number  greater 
than  25  percent  of  the  population  would  be  in  the  audience 
of  both  venicle  A  and  vehicle  B  and  correspondingly  the 
coverage  of  the  two  vehicles  would  be  less  than  75  per¬ 
cent  of  the  population. 

This  kind  of  situation  is  largely  accounted  for  by 
the  distribution  of  the  audiences  over  the  cells  which 
type  the  population:  since  the  population  dimensions 
serve  to  explain  much  of  the  readership  of  a  vehicle,  two 
vehicles  which  empirically  have  a  higher  'han  random  dup¬ 
lication  will  tend  to  have  large  audiences  or  high 


« 


■  f  -i? 


"'vs 


probabilities  of  exposure  in  the  same  cells  or  popu¬ 
lation  uypes.  Conversely,  two  vehicles  which  have  lower 
them  average  random  duplication  will  probably  have  their 
high  audiences  in  different  population  types.  Therefore 
we  expect  that  a  major  part  of  the  nonrandom  duplication 
found  between  any  two  vehicles  will  be  explained  (and 
produced)  simply  by  the  relative  distributions  of  the 
audiences  of  the  two  vehicles  over  the  population  types. 

We  have  tested  this  expectation  for  an  extreme  case 
with  two  of  the  vehicles  in  the  trial  scenarios,  the 
Cincinnati  evening  newspapers.  Each  of  these  newspapers 
has  a  very  high  average  audience  and  on  the  basis  of  the 
random  audience  Oistribution  model,  we  would  expect  to 
find  a  very  high  duplication  of  audience  between  the  two. 
We  can  even  argue  that  in  the  simulation  and/or  empiric¬ 
ally  we  might  even  expect  larger  than  random  duplication 
since  the  readers  of  one  evening  newspaper  might  be  much 
like  the  readers  of  another  evening  newspaper  in  their 
distribution  by  sex,  age,  interest,  socioeconomic  status, 
and  education.  Of  course  we  find,  cn  the  contrary,  that 
the  empirical  duplication  is  much  lower  than  the  duplica¬ 
tion  one  would  expect  simply  by  chance;  the  readers  of 
one  evening  newspaper  do  not  generally  read  the  other 
evening  newspaper.  Thus,  if  for  these  two  vehicles  for 
which  there  is  a  large  difference  between  the  empirical 
and  chance  duplication,  we  find  that  the  distribution  of 
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the  two  audiences  across  the  cells  accounts  for  most  of 
the  difference  between  the  empirical  and  the  random  dup¬ 
lication,  then  a  fortiori  we  can  assume  for  other  vehicles 
that,  even  before  the  assignment  process,  the  cell  audi¬ 
ences  have  probably  accounted  for  most  of  the  nonrandom 
duplication  found  between  the  two  vehicles. 

The  audiences  of  the  Times -Star  and  Post  are  50.2 
percent  and  47.6  percent  of  the  population  respectively. 

If  these  two  audiences  *  re  randomly  distributed  across 
the  population,  we  would  expect  a  chance  duplication 
based  simply  on  these  audience  sizes  equal  to  23.8  per¬ 
cent  of  the  population,  or  that  23.8  percent  of  the 
population  is  exposed  on  the  average  to  both  the  evening 
newspapers.  The  empirical  duplication,  however,  is  5.2 
percent  of  the  population;  as  might  be  expected,  the  actual 
duplication  between  the  two  evening  newspapers  is  much 
lower  than  an  estimate  based  on  a  .  iy  rand  iistribu- 
tion  of  the  audiences  across  the  population.  What  propor¬ 
tion  of  this  nonrandom  duplication  is  accounted  for  by  the 
two  audience  distributions  over  the  five  dimensions? 

This  expected  duplication,  based  on  the  individual  cell 
audiences  of  the  two  vehicles,  is  just  the  sum,  over  the 
144  cells  or  subgroups  defined  by  the  five  dimensions,  of 
the  duplication  within  each  cell.  This  expected  audience 
of  both  evening  newspapers  divided  by  the  simulation  popu¬ 
lation,  givei  * e  proportion  of  the  population  in  the 
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audience  of  both  the  vehicles.  The  trial  scenarios  do 
not  produce#  for  each  of  the  evening  newspapers,  the  cell 
audience-  actually  achieved  by  the  random  assignment  of 
the  discrete  probabilities  produced  by  the  cumulation 
routine.  We  know,  however,  that  these  probabilities  pro¬ 
duce  average  audiences  which  are  quite  close  to  the  value 
generated  by  the  parameter  estimation  routine  for  the  144 
cells;  therefore  we  have  used  these  values,  which  are 
available,  in  the  calculation  of  the  duplication  between 
the  two  vehicles  based  simply  upon  the  average  audiences 
in  the  cells.  The  result  of  the  calculation  is  a  dupli¬ 
cation  of  7.28  percent  of  the  population  for  the  two 
evening  newspapers.  Thus  the  simulation  distribution  of 
the  audiences  of  the  two  vehicles  over  the  population 
types,  based,  of  course,  on  the  input  data,  has  produced 
a  duplication  between  the  two  vehicles  which  comes  quite 
close  to  the  empirical  duplication,  5.2  percent,  the  dif¬ 
ference  being  only  2.1  percent  of  the  population.  There¬ 
fore,  we  conclude  that  even  if  the  within  cell  assignment 
of  probabilities  to  adjust  for  the  nonrandom  duplication 
not  accounted  for  by  the  cell  audiences  is  a  total  fail¬ 
ure,  the  distribution  of  the  audiences  across  the  popula¬ 
tion  types  chosen  for  the  simulation  will  most  likely 
produce  duplication  values  which  are  auite  close  to  the 
empirical  findings. 
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The  Internal  Validity  of  the  Model 


Charles  Herman,  in  a  paper  on  validation  problems  in 
games  and  simulations  defines  the  concept  of  the  internal 
validity  of  a  modili 

The  unexplained  variance  between  intended  replica¬ 
tions  would  provide  a  measure  of  reliability  or  what 
Campbell.  .  .  calls  "Internal  Validity."  When  the 
structured  simulation  properties  are  held  constant, 
the  smaller  the  between  run  variance,  the  greater 
the  internal  validity  is  assumed  to  be. 4 

This  concept  of  validity  does  not  apply  in  an  absolutely 
straightforward  way  to  the  present  simulation  since  the 
output  of  the  simulation  is  in  the  form  of  expected  values 
of  the  ranaom  variables.  In  order  to  judge  the  internal 
validity  of  the  model,  we  must  know  the  range  of  possible 
outcomes  for  different  runs  given  identical  parameter 
values.  An  estimate  of  the  variance  of  the  random  vari¬ 
ables,  in  addition  to  the  expected  values,  would  be  re¬ 
quired.  Because  of  space  and  time  limitation  we  have  ot 
programmed  the  simulation  to  produce  these  variances,  al¬ 
though  this  is  quite  easy  to  do  mathematically.  However, 
we  can  report  on  the  variability  in  the  expected  values 
due  to  the  random  probability  assignment. 

The  expected  values  of  the  random  variables  which  are 
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Charles  F.  Herman,  "Validation  Problems  in  Games  and 
Simulations  with  Special  References  to  Models  of  Inter¬ 
national  Politic.-,"  Behavioral  Science,  Volume  12,  1967, 
pp.  216-231.  ~  ””  ‘ 
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output  by  the  simulation  are  really  expected  values  con¬ 
ditioned  upon  the  random  assignment  of  probabilities  to 
the  cells  and  the  pairing  of  these  probabilities  to  ac¬ 
count  for  the  duplication.  In  theory  we  could  generate 
a  probability  distribution  for  ail  possible  assignments 
of  probabilities  given  the  assignment  process  and  combine 
that  with  the  message  expov’ire  probabilities  to  generate 
unconditional  expected  values  \t  the  output  stage  of  the 
simulation;  however,  this  theoretical  possibility  is  math¬ 
ematically  quite  complex  and  we  have  not  attempted  to  do 
this.  Therefore,  we  are  left  with  the  internal  validity 
problem:  what  is  the  range  of  the  conditional  expected 
values  of  the  output  variables  given  other  runs  of  the 
simulation  and  with  other  assignments  of  the  discrete 
probabilities?  For  the  total  popuZ  ation  such  statistics 
as  cumulation,  the  average  number  of  exposure  events,  and 
the  proportion  of  persons  exposed  at  a  given  time  period 
are  constant  for  different  random  assignments  of  the  prob¬ 
abilities,  since  the  distribution  of  the  discrete  proba¬ 
bilities  generated  by  the  cumulation  routine  is  completely 
determined.  Therefore,  any  statistics  for  a  single  ve¬ 
hicle  and  the  entire  population  at  every  time  period  must 
remain  constant  regardless  of  t£e  assignment  process. 
However,  the  same  statistics  need  not  remain  constant  for 
a  vehicle  within  a  given  cell  since  the  random  assignment 
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process  will  in  general  produce  different  discrete  proba¬ 
bilities  in  each  cell  of  the  simulation. 

A  comparison  of  the  expected  audience  of  the  Enquirer 
as  output  from  the  parameter  estimation  routine  with  that 
produced  by  the  probability  assignment  (Tables  VIII-2  and 
VIII-6)  gives  some  indication  of  the  variability  which  we 
might  expect  at  the  level  of  the  subgroup  s  which  we  typi¬ 
cally  output  from  the  simulation.  The  rardom  assignment 
process  attempts  to  produce  in  each  cell  an  expected 
audience  which  was  arrived  at  by  the  parameter  estimation 
routine.  The  amount  by  which  the  particular  assignment 
which  we  have  observed  differs  from  these  expected  values 
gives  us  a  measure  of  the  variability  which  we  might  ex¬ 
pect  from  the  random  assignment  process,  i.e.  if  we  look 
at  the  differences  between  the  values  required  by  the 
estimation  process  and  the  values  actually  achieved  by 
the  random  assignment,  unless  this  random  assignment  is 
a  very  peculiar  one  (which  would  be  quite  unusual  over 
the  twenty-four  subgroups  in  Tables  VIII-2  and  VIII-6), 
then  these  differences  indicate  the  a-uount  of  fluctuation 
we  can  expert  in  the  ce)  .  by  cell  average  audiences  due 
to  the  random  assignment  process.  The  tables  show  that 
the  amount  of  this  error  is  at  most  2*1  persons  exposed 
in  one  cell  and,  in  general,  that  the  error  is  very  small. 
Therefore  we  may  conclude  that  the  variability  produced 
by  the  random  assignment  process  does  not  affect  the 
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expected  audiences  in  the  output  subtable3  by  more  than 
a  few  percent. 

How  does  the  expected  cumulative  audience  of  a  ve¬ 
hicle  in  the  individual  cells  differ  from  random  assign¬ 
ment  to  random  assignment?  We  don't  know  the  answer  to 
this,  but  since  the  average  audience  seems  to  exhibit 
little  variability  from  random  assignment  to  random  as¬ 
signment  we  speculate  that  the  cumulative  audiences  will 
also  exhibit  small  variability  from  random  assignment  to 
random  assignment,  although  admittedly  that  variability 
may  be  magnified  as  the  number  of  time  periods  in  the 
emulation  grows  larger  and  larger. 

The  foregoing  discussion  has  been  in  terms  of  the 
possible  audiences  variations  in  various  population  types 
for  a  single  vehicle.  We  saw  that  the  average  and  cumu¬ 
lative  audiences  of  the  vehicle  over  the  entire  popula¬ 
tion  would  not  charge  given  a  different  random  assignment 
of  probabilities  Ljt  that  there  might  be  some  slight 
differences  in  the  expected  audiences  for  a  given  cell  of 
the  population  given  a  different  assignment  of  probabil¬ 
ities  and  there  may  be  some  variation,  although  hopefully 
still  a  small  variation,  in  the  cumulative  audiences  for 
a  given  cell  or  subgroup.  Also  these  variabilities  due 
to  sampling  will  be  smaller  the  larger  the  population  of 
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the  cell  or  the  smaller  the  dimensionality  of  the  sub¬ 
table  containing  the  output. 

What  can  we  say  about  the  variability  in  the  average 
and  cumulative  audiences  of  a  theme  of  messages  appearing 
in  several  different  vehicles?  Obviously  the  exposure  of 
an  audience  to  a  theme  of  messages  involving  several  ve¬ 
hicles  depends  in  part  upon  the  duplication  between  the 
vehicles,  which  is  to  some  extent  manipulated  by  the  pro¬ 
cess  of  assigning  probabilities  across  vehicles  within 
the  cells.  The  particular  probabilities  which  are  ran¬ 
domly  assigned  to  the  cells  may  affect  the  ability  of 
the  simulation  to  manipulate  the  duplication  to  conform 
to  the  empirical  duplication.  This  variability  in  dupli¬ 
cation  from  assignment  to  assignment  is  something  which 
we  have  no  measure  of;  however,  we  note  from  the  discus¬ 
sion  of  duplication  above  that  by  far  the  largest  amount 
of  duplication  across  vehicles  is  accounted  for  by  the 
average  audiences  of  these  vehicles  within  each  cell  of 
the  simulation.  These  mean  audiences  are  fixed  by  the 
parameter  estimation  process  and  do  not  fluctuate  sub¬ 
stantially  given  random  assignment  of  probabilities  as  we 
have  seen  from  our  comparison  of  Tables  VIII-2  and  VIII-6 
above.  Thus  we  conclude  that  the  largest  proportion  of 
the  duplication  will  remain  relatively  constant  from  ran¬ 
dom  assignment  to  random  assignment,  but  t  some  pro¬ 


portion  of  the  already  small  proportion  of  the  duplication 
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Which  is  manipulated  by  the  assignment  process  will  change 
from  random  assignment  to  random  assignment.  Thus,  al¬ 
though  the  exposure  of  the  audience  within  a  cell  or  over 
the  entire  population  to  a  theme  of  messages  involving 
several  vehicles  will  fluctuate  from  random  assignment  to 
random  assignment,  we  offer  a  rough  estimate  that  this 
variability  in  the  "expected"  audience  will  not  be  greater 
than  a  few  percent. 

Some  Final  Thought  About  Internal  Validity 

Although  we  have  introduced  the  notion  of  internal 
validity  and,  in  keeping  with  the  spirit  of  the  concept, 
attempted  to  make  some  assessments  of  the  between  run 
variability  in  the  simulation  (noting,  however,  that  the 
most  important  variance  was  not  calculated  in  programing 
the  model) ,  we  admit  to  some  second  thoughts  about  the 
implications  of  the  concept.  The  concern,  stimulated  by 
Hanna's  ideas  about  validating  models,  is  that  a  model 
may  have  too  little  between  run  variability. 

What  is  the  relationship  between  the  observed  data 
and  reality?  We  consider  that  the  observed  data  are  pro¬ 
duced  by  a  combination  of  the  factors  which  the  model 
attempts  to  account  for,  other  (exogenous)  factors  which 
the  model  does  not  attempt  to  account  for,  and  measure¬ 
ment  error.  If  the  real  world  could  be  "rerun"  (as  the 
simulation  can) ,  controlling  for  the  simulation  factors. 
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but  allowing  all  possible  variation  in  the  unaccounted 
for  factors  and  measurement  error,  then  we  could  produce 
a  probability  distribution  for  the  possible  observed  out¬ 
comes.  The  simulation  is  valid  insofar  as  its  probabil¬ 
ity  distribution  of  simulated  outcomes  matches  this  hypo¬ 
thetical  probability  distribution  of  real  world  outcomes. 

Referring  to  the  exogenous  factors  and  measurement 
errors  as  the  loci  of  ignorance  of  the  model,  Hanna  shows 
that  ”... the  most  informative  model  is  one  whose  predic¬ 
tions  have  exactly  the  degree  of  precision  permitted  by 
the  loci  of  ignorance  associated  with  the  referent  situ¬ 
ation.  Thus,  one  can  lose  information  due  to  over-spec- 

5 

ific  as  well  as  the  under-specific  predictions." 

The  Results  of  the  Trial  Scenarios 

We  turn  now  to  the  graphs  depicting  the  growth  of 
exposure  to  the  trial  scenarios  among  the  various  popula¬ 
tion  subgroups.  The  graphs  in  Figures  VIII-1  through 
VIII-3  show  the  growth  of  the  cumulative  percentages  of 
males  and  females  exposed  at  least  once  to  the  scenario. 
The  upper  points  represent  exposure  to  vehicles,  the  lower 
points  exposure  to  messages  in  these  vehicles.  For  each 
of  the  scenarios,  the  percentages  exposed  at  least  once 
to  the  vehicles  very  quickly  approach  100  percent.  Thus 

5 

Joseph  F.  Hanna,  Information-Theoretic  Techniques 
for  Evaluating  Simulation  Models,  Michigan  State  Univer¬ 
sity,  1969,  Mimeo,  pp.  11-12 . 
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for  the  first  trial  scenario,  the  vehicle  audience  rises 
to  greater  than  95  percent  by  the  end  of  the  thirteen th 
time  period,  which  is  the  last  radio  message  in  the  day. 
There  are  two  points  where  the  graph  rises  quite  rapidly — 
in  the  second  time  period  when  the  mroning  Enquirer  is 
available  and  in  the  seventh  time  period  when  the  Post 
and  the  Times-Star ,  the  two  evening  papers ,  become 
available.  It  is  quite  likely  that  these  increases 
should  not  be  so  abrupt,  that,  in  fact,  the  exposure  takes 
place  over  several  hours  after  the  newspapers  hit  the 
newsstands ;  however,  we  have  no  way  of  taking  account  of 
this  in  the  simulation.  At  any  rate,  by  the  end  of  the 
day  at  least  95  percent  of  both  men  and  women  are  exposed 
to  a  vehicle  at  least  once. 

The  graph  of  the  cumulative  percent  exposed  to  a 
message  is  similar  in  shape  to  that  of  the  curve  for  the 
cumulative  number  exposed  to  the  vehicle,  but  the  cumu¬ 
lative  percent  exposed  at  each  time  period  is  significant¬ 
ly  less.  Thus  at  the  end  of  13  time  periods,  for  the 
first  trial  scenario  the  cumulative  percentage  of  males 
and  females  exposed  at  least  once  to  the  message  is  about 
66  and  54  percent,  respectively.  One  effect  of  the  more 
realistic  message  exposure  probabilities  is  to  make  a 
larger  difference  between  the  proportion  of  males  and 
the  proportion  of  females  exposed  to  the  message. 

Because  the  cumulative  percentage  of  people  exposed 


i  VIII-1.  The  Cumulative  Percentage  of  Males  (x)  and  Females 
Exposed  at  Least  Once  to  the  First  Trial  Scenario 


Vehicle  Audience 


Figure  VIII-2.  The  Cumulative  Percentage  of  Males  (x)  and  Females  (.)  Exposed 

at  Least  rnce  to  the  Second  Trial  Scenario 


The  Cumulative  Percentage  of  Males  (x)  and  Females  (*)  Exposed 
Least  Once  to  the  Third  Trial  Scenario 
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grows  so  quickly  toward  100  percent,  we  have  plotted  a 
second  set  of  curves  (Figures  VIII-4  through  VIII-12) 
representing  the  growth  of  the  average  cumulative  number 
of  exposures  for  the  subgroups  defined  by  sex,  education, 
and  socio-economic  status.  This  is  a  much  more  important 
measure  of  exposure  since  it  seems  likely  that  a  certain 
minimum  number  of  exposures  is  necessary  in  order  for  a 
person  to  retain  or  be  effected  by  the  content  of  the 
average  message.  We  observe  an  interesting  effect  in 
the  graphs  by  sex.  For  the  vehicle  audiences,  women  are 
consistently  more  exposed  than  men;  however,  when  the 
message  exposure  probabilities  are  used,  the  men  are  the 
more  exposed.  Of  course,  none  of  the  differences  in  ex¬ 
posure  between  subgroups  is  very  large  (a  maximum  of 
about  1.5  exposures  by  the  eighteenth  time  period)  for 
these  short  scenarios. 

Table  VIII-7  summarizes  the  results  of  the  three 
trial  scenarios  after  the  final  time  period  for  each 
theme.  This  table  presents  data  on  the  average  of  the 
cumulative  number  of  exposure  events  for  various  sub¬ 
groups  in  the  population.  The  first  of  each  pair  of  rows 
describes  the  vehicle  audiences  and  the  second  the  message 
audiences.  Since  the  second  theme  is  equivalent  to  the 
first  theme  run  three  times,  we  will  look  particularly 
at  the  first  and  the  third  themes  for  our  analysis . 

First,  the  message  audiences  are  about  one-quarter 
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6.  The  Average  Cumulative  Number  of  Exposures  of  Males  (x) 
Females  (•)  to  the  Third  Trial  Scenario 


Figure  VIII-7.  The  Average  Cumulative  Number  of  Exposures  for  College 
(x).  High  School  (®) ,  and  Grade  School  (•)  Education  to  the  First 

Trial  Scenario 


8.  The  Average  Cumulative  Number  of  Exposures  for  College  (x) 
School  (®)>  and  Grade  School  (•)  Education  to  the  Second  Trial 


Message  Audience 
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11-11.  T’he  Average  Cumulative  Number  of  Exposures  for  Hig 
Middle  V<£) ,  and  Low  (•)  SES  to  Theme  2 


Table  VIII-7.  The  Average  Cumulative  Number  of  Exposures  to  Vehicles  or  Messages9  in 
Several  Population  Subgroups  for  Three  Trial  Scenarios 


with  more  realistic  message  exposure  probabilities  and  show 
s  carried  in  the  vehicles. 


to  one- half  of  the  vehicle  audiences  regardless  of  the 
dimension  of  the  subgroup.  For  example,  in  the  first 
theme  with  a  message  in  each  radio  broadcast  and  in  each 
newspaper,  there  is  at  the  end  of  the  day  an  average  of 
3.05  exposures  to  a  vehicle  carrying  the  message  and  1.06 
exposures  to  the  message  itself.  The  lower  figure  occurs 
in  the  third  theme  where  the  ratio  of  messages  in  the 
newspapers  to  messages  in  the  radio  increases:  since  the 
message  exposure  probabilities  for  newspapers  are  about 
one-third  of  those  for  radio,  we  expect  and  find,  that 
as  the  proportion  of  newspaper  messages  increases,  the 
average  number  of  exposures  to  the  message  becomes  a 
smaller  proportion  of  the  average  number  of  exposures  to 
the  vehicle. 

Socio-eco.  10a ic  status  appears  to  be  the  most  impor¬ 
tant  variable  fcr  the  vehicle  audience,  producing  the 
largest  range  of  exposures,  but  interest  is  the  most 
important  dimension  for  the  message  audience  given  the 
distribution  of  messages  and  exposure  probabilities  we 
have  chosen  for  these  trial  scenarios.  For  the  first 
themes  the  average  number  of  message  exposures  at  the  end 
of  the  day  ranges  from  a  low  of  .61  exposures  for  the 
lowest  interest  group  to  a  high  of  1.61  exposures  for 
the  highest  interest  group.  For  the  second  them^  the 
figures  are  just  three  times  as  large,  reflecting  the  ex¬ 
posure  at  the  end  of  three  days  of  messages.  Thus  the 


332 


vehicle  audience  at  the  end  of  three  days  has  experienced 
an  average  of  approximately  9  exposures  and  the  message 
audience  an  average  of  approximately  3  exposures.  For  a 
three-day  large  news  story  with  three  messages  in  each 
newspaper,  we  find  a  marked  increase  in  vehicle  exposure 
and  a  moderate  increase  in  the  average  number  of  expo¬ 
sures  to  the  message.  In  this  particular  case,  the  val¬ 
ues  for  the  vehicle  audience  are  not  realistic  because 
the  simulation  treats  each  appearance  of  the  message  in 
a  vehicle  as  equivalent  to  an  appearance  of  the  vehicle 
with  one  message  in  it.  Thus,  the  three  messages  in  the 
morning  Enquirer  become  equivalent  to  three  viewings  of 
the  Enquirer  and  are  added  to  the  vehicle  audience  three 
times.  However,  for  the  message  audience  the  numbers 
represent  the  average  of  the  cumulative  number  of  message 
exposures  for  each  of  the  subgroups,  just  as  above.  In 
this  case,  the  average  number  of  message  exposures  over 
the  three  days  was  4.60  and  again,  the  most  important 
dimension  is  interest  where  the  average  number  of  expo¬ 
sures  ranges  from  only  2.51  for  the  lowest  interest 
group  to  7.21  for  the  highest  interest  group. 

In  addition  to  considering  each  of  the  population 
dimensions  individually  we  present  in  Table  VIII-8  those 
subgroups  having  the  highest  and  the  lowest  frequencies 
of  exposure  for  the  combination  of  the  three  dimensions 
sex,  socio-economic  status,  and  interest.  We  find 
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average  message  audience  differences  of  five  to  six  hun¬ 
dred  percent  between  the  highest  and  the  lowest  subgroups 
For  instance,  for  the  large  three-day  story,  the  least 
exposed  subgroup  defined  by  these  three  dimensions  were 
females  of  low  socio-economic  status  and  low  interest  who 
had  an  average  of  only  1.14  exposures  over  the  three  days 
The  highest  subgroup,  on  the  other  hand,  were  the  males 
of  middle  socio-economic  status  and  high  interest  who  had 
almost  7  times  the  exposure  of  the  lowest  group,  a  figure 
of  7.78  exposures  in  the  three-day  period. 

Finally,  Table  VIII-9  shows  the  relative  importance 
of  the  media  vehicles  in  producing  message  exposures 
(but  not  numbers  of  persons  exposed  to  the  messages)  to 
the  trial  scenarios.  For  the  one-and  three-day  average 
stories,  the  evening  news  broadcasts  are  most  important 
followed  by  the  newspapers,  and  then  other  news  broad¬ 
casts  at  peak  listening  hours.  For  the  larger  story 
where  the  newspapers  each  have  three  insertions  of  the 
message,  the  newspapers  become  the  most  important  vehi¬ 
cles,  re/ersing  place  with  the  evening  news  broadcasts, 
which  are  now  second  in  order  of  importance.  The  more 
messages  carried  by  the  newspapers,  of  course,  the  more 
important  become  the  newspapers  relative  to  the  radio 
broadcasts . 

From  the  analysis  of  the  trial  scenarios  we  conclude 
that  the  simulation  probably  has  internal  validity,  that 


Table  VI1I-8.  Population  Subgroups  by  Sex,  SES,  and  Interest,  Having  the  Highest  and 
Lowest  Average  Cumulative  Number  of  Exposures  to  Three  Trial  Scenarios 
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it  does  synthesize  consistently  the  input  data,  and  that 
it  produces  plausible  expected  number  of  exposures  for 
various  population  types.  For  an  average  story,  reported 
once  in  each  of  the  vehicles  of  the  Cincinnati  mass 
media  system  during  the  day,  the  average  number  of  expo¬ 
sures  at  the  end  of  the  day  is  slightly  more  than  one 
exposure  per  person.  However,  this  average  varies  with 
a  range  from  .30  to  1.74  exposures,  depending  upon  the 
population  type.  The  most  important  vehicles  seem  to 
be  the  newspapers,  followed  by  the  evening  and  then  the 
midday  and  then  the  morning  radio  broadcasts,  in  that 
order.  We  turn  now  to  the  real  scenarios  to  see  what 
actually  happened  during  the  six  months  of  the  Cincinnati 
information  campaign. 


The  Results  of  the  Runs  with  the  Real  Scenario 


As  we  noted  in  Chapter  VII  above,  of  our  original  17 
themes  in  the  content  analysis,  we  ended  with  a  run  of  12 
themes  in  the  scenario.  Four  themes  were  eliminated  be¬ 
cause  of  the  small  number  of  messages  coded  in  the  con¬ 
tent  analysis  for  these  themes.  Unfortunately  an  impor¬ 
tant  theme  concerning  control  of  the  atomic  bomb  was  lost 
because  of  a  programming  error  in  the  simulation.  In 
the  following  pages  we  shall  examine  the  growth  and  rates 
of  exposure,  the  distribution  of  exposure  throughout  the 


337 

population  subgroups,  and  the  important  media  vehicles, 
for  the  twelve  scenario  themes. 

Cumulative  Exposure:  The  Population  Totals  After  Thirteen 
Time  Periods 

In  Table  VIII-10  we  summarize  some  of  the  relevant 
input  and  output  statistics  for  each  of  the  12  themes 
actually  run  in  the  scenario.  The  first  columns  of  the 
table  show  for  each  of  the  themes  the  number  of  messages, 
the  number  of  exposure  events  resulting  from  those  mes¬ 
sages,  the  average  number  of  exposure  events  for  the 
population,  the  number  of  exposure  events  per  message, 
the  total  number  of  persons  exposed  to  each  theme,  and 
the  number  of  persons  exposed  per  message.  These  statis¬ 
tics  are  the  values  at  the  end  of  13  time  periods  (or 
the  end  of  the  simulated  six  month  interval) ,  except  for 
tile  twelvth  theme  which  was  cut  off  after  12  time  per¬ 
iods.  The  second  column  of  numbers,  the  number  of  ex¬ 
posure  events,  is  the  expected  number  of  exposures  for 
the  total  population.  Of  course,  this  average  number  of 
exposures  is  not  necessarily  the  actual  number  of  expo¬ 
sures  of  any  particular  individual.  The  number  of  expo¬ 
sure  events  per  message  is  the  total  number  of  expected 
exposures  for  the  population  divided  by  the  number  of 
messages.  This  number  depends  upon  the  size  of  the  audi¬ 
ence  of  the  ehicle  in  which  the  message  appears  and  the 


Table  VIII-10.  The  Cumulative  Exposure  of  the  Simulation  Population  After  Six  Months 

to  the  Twelve  Scenario  Themes 
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format  of  the  message,  but  remarkably  this  figure  is  al¬ 
most  constant  for  many  themes.  Thus,  for  every  theme 
except  themes  nine  and  ten,  the  number  of  exposure  events 
per  message  lies  between  60.8  and  69.7.  This  indicates 
that  for  many  themes  the  distribution  of  the  messages  in 
the  vehicles  and  the  distribution  of  the  formating  cf 
the  messages  within  each  vehicle  was  similar. 

The  average  number  of  exposures  for  a  person  in 
the  population  varies  from  a  low  average  of  1.1  exposures 
for  the  theme  of  the  ,N.  and  human  rights,  for  which 
there  were  only  32  messages,  to  a  high  average  of  57.2 
exposures  for  the  most  common  theme  of  violence,  wars, 
and  threats  to  peace,  which  contained  1761  messages. 
(Recall,  however,  that  we  were  forced  to  reduce  the  num¬ 
ber  of  messages  in  this  theme  by  one-fifth.  Thus  the 
average  number  of  exposures  may  be  more  on  the  order  of 
71.5  exposures.)  We  note  here  that  the  greatest  average 
number  of  exposures  to  themes  five  through  twelve  is  5.1 
exposures  over  the  six  month  period.  These  themes  cure 
all  themes  relating  to  the  United  Nations  and  to  the 
questions  of  the  NORC  survey  attempting  to  measure  the 
changes  in  the  Cincinnati  population  over  the  six  month 
period.  With  this  very  limited  number  of  exposures  over 
the  six  months  and  given  that  those  already  knowing  those 
facts  or  having  those  attitudes  about  the  United  Nations 
which  the  campaign  was  attempting  to  disseminate  were 


probably  more  likely  to  have  higher  average  exposures/ 
it  is  not  surprising  that  the  campaign  found  little  suc¬ 
cess  and  little  change  in  the  Cincinnati  population. 

Not  only  do  these  figures  indicate  that  the  messages  in 
the  mass  media  failed  to  reach  those  groups  which  in¬ 
itially  were  least  aware ,  even  if  the  population  had 
been  reached  in  an  undifferentiated  way  so  that  everyone 
had  the  average  exposure/  it  seems  hardly  likely  that 
five  exposures  over  six  months  are  enough  to  cause  sig¬ 
nificant  change  in  the  attitudes  or  information  of  the 
population. 

Since  the  number  of  exposure  events  per  message  is 
relatively  constant  for  most  themes,  one  can  estimate, 
simply  by  knowledge  of  the  number  of  messages,  the  total 
number  of  exposures  for  the  population  and  thus  an  aver¬ 
age  number  of  exposures  for  each  member  of  the  popula¬ 
tion.  We  shall  see  (below)  that  there  are  certain  regu¬ 
lar  trends  of  exposure  over  the  classes  of  each  of  the 
dimensions  defining  the  population  types.  Might  it  not 
be  possible  to  distribute  these  average  number  of  expo¬ 
sures  over  the  classes  of  the  dimensions  according  to 
these  trends  and  then  calculate  expected  values  over  all 
the  population  types  to  produce  the  average  number  of 
exposure  events  for  each  population  type?  Unfortunately, 
because  of  interaction  effects,  this  is  not  totally 
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feasible,  but  might  be  worth  further  effort  in  investi¬ 
gating  this  possibility. 

The  number  of  persons  exposed  is  an  expected  number 
of  individuals  exposed  at  least  once,  and  of  course  there 
is  upward  bound  to  this  number  which  is  the  total  popula¬ 
tion  of  the  simulation.  As  more  messages  appear  and  go 
out  in  the  mass  media,  more  and  more  people  are  reached 
at  least  once.  People  reached  in  the  early  time  periods 
accumulate  exposures,  but  do  not  add  to  this  figure  of 
the  expected  number  of  persons  exposed  at  least  once. 
Thus,  we  see  that  for  those  themes  with  several  messages, 
the  expected  number  of  exposures  is  almost  the  entire 
population  and  even  for  those  themes  with  very  few  mes¬ 
sages  the  expected  number  of  exposures  is  at  least  half 
and  usually  about  three-fourths  of  the  population.  There 
is  no  way  of  telling  how  many  of  these  individuals  are 
exposed  once  or  twice  in  the  six  month  period,  and  it 
seems  quite  unlikely  that  an  exposure  of  only  once  or 
twice  in  the  six  month  period  to  most  of  the  messages 
carried  in  these  themes  will  make  any  lasting  inpression 
upon  the  individual.  A  much  more  interesting  statistic 
would  be,  of  course,  the  expected  number  of  persons  ex¬ 
posed  zero  times,  one  time,  two  times,  three  times,  and 
so  on,  i.e.  the  frequency  distribution  of  exposures  with¬ 
in  each  subgroup  of  the  population.  However,  this  dis¬ 
tribution,  although  in  principle  quite  easy  to  calculate. 
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is  impossible  t.o  manage  in  the  computer  for  which  the 
simulation  was  programmed.  The  amount  of  space  avail¬ 
able  for  storing  statistics  was  just  too  limited  to 
allow  this  kind  of  detail  for  this  and  several  other 
statistics.6 

As  more  and  more  messages  appear,  the  number  of 
times  a  given  individual  is  exposed  is  likely  to  in¬ 
crease;  therefore  the  average  number  of  persons  exposed 
per  message  decreases  with  an  increasing  number  of  mes¬ 
sages  and  a  fixed  population  size.  This  is  the  effect 
observed  in  the  final  two  columns  of  Table  VIII-10. 

Cumulative  Exposure :  The  Distribution  of  Exposure  over 
the  Population  Subgroups 

The  data  above  compare  total  exposure  from  theme  to 
theme  as  a  function  of  the  number  of  messages;  in  addi¬ 
tion  we  can  identify  for  each  theme  those  population 
subgroups  most  exposed  and  those  least  exposed  to  each 
theme.  The  average  cumulative  number  of  exposures  for 
each  level  of  each  population  dimensions  and  the  overall 
averages  are  tabulated  in  Table  VIII-11,  at  the  thirteenth 
time  period.  Unfortunately,  the  data  for  the  first  four 
themes  and  especially  for  the  third  theme  have  signifi¬ 
cant  errors  in  their  absolute  magnitudes. 

®The  simulation  is  presently  being  reprogrammed  for 
the  IBM  360-65,  a  much  larger  machine  than  the  present 
one,  an  IBM  7094.  On  the  new  machine,  the  storage  prob¬ 
lems  will  be  much  less  critical  and  these  statistics  will 
be  available. 
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Because  of  a  limitation  of  space  in  the  computer 
and  the  programming  necessary  to  take  account  of  this, 
the  data  necessary  to  generate  these  statistics  is  ac¬ 
cumulated  in  the  computer  as  the  expected  number  of  ex¬ 
posures  for  each  individual  in  the  simulation  population 
via  each  of  the  media  types  for  up  to  six  media  types. 
For  a  simulation  population  of  2000  and  six  media  types, 
12,000  numbers  are  kept  cumulatively  up-to-date  in  order 
to  generate  these  average  number  of  exposures  presented 
in  Tables  VII1-11  and  VIII-12.  in  order  to  keep  such 
large  numbers  of  statistics  in  the  computer  the  simula¬ 
tion  was  programmed  to  pack  these  statistics  four  to  a 
word  in  a  computer  word.  Exposures  are  then  measured 
in  fiftieths  allowing  as  many  as  511  fiftieths  or  10,2 

7 

exposures  for  each  individual  via  each  media  type. 

For  those  themes  which  have  a  large  number  of  messages, 
the  number  of  exposures  for  some  individuals  via  some 
media  types  exceeded  10.2  exposures.  At  this  point 
the  computer  stops  accumulating  exposures  for  this  per¬ 
son  via  this  media  type  and  the  record  indicates  only 
10.2  exposures,  thus  undercounting  exposures  for  this 
person.  At  the  time  of  programming  the  computer  this 
seemed  a  logical  procedure  especially  if  the  output  is 

7 

On  the  new  machine  (see  Footnote  6,  above)  this 
upper  limit  will  be  much  higher  and  adjustable  by  the 
researcher. 


I 


344 

in  terms  of  a  distribution  of  exposures  across  individ¬ 
uals.  Thus,  if  we  know  the  expected  number  of  individ¬ 
uals  exposed  zero,  one,  two,  three,  four,  five,  etc., 
times,  then  having  an  upper  limit  of  ten  or  more  expo¬ 
sures  is  quite  suitable  for  data  output  in  this  form; 
after  ten  exposures,  whatever  effect  might  happen  has 
probably  happened.  However,  Tor  calculating  the  average 
number  of  exposures  for  subgroups  of  the  population, 
this  procedure  gives  inaccurate  averages,  the  larger 
the  average  number  of  exposures  for  the  entire  popula¬ 
tion,  the  more  inaccurate  cure  the  averages  generated  by 
the  process. 

Each  individual  can  have  recorded  a  maximum  of 
10.2  exposures  for  each  media  type,  resulting  in  a 
theoretical  upper  maximum  of  61.2  exposures  which  could 
be  recorded  for  an  individual.  Table  VIII-10  shows  a 
maximum  average  of  57.2  exposures  for  the  population 
for  the  theme  three.  Only  if  the  exposures  were 
evenly  distributed  across  the  entire  population  and 
across  each  of  the  media  types  would  the  simulation 
accurately  produce  the  average  number  of  exposures 
for  subgroups  of  the  population.  Fortunately,  there 
are  other  cumulative  totals  (for  the  individual  media 
vehicles  and  each  of  the  six  media  types)  for  the  num¬ 
ber  of  exposures,  and  these  statistics  are  accurate. 

From  these  we  get  an  estimate  of  the  amount  of 


Table  VIII-11,  The  Average  Cumulative  Number  of  Exposures  for  Several  Simulation 

Population  Subgroups  after  Six  Months 
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Because  of  a  programming  error,  the  twelfth  theme  was  terminated  after  twelve  rather 
than  thirteen  time  periods  or  68  rather  than  75  messages. 


Table  VIII-11.  (Continued) 


M 


! 


346 


S 


>i  © 

A 

00  O' 

O'  00 

o 

O 

1"  o 

i" 

rH 

o 

CM 

rH  A 

O' 

TO  VO 

M"  CM 

CO 

TO 

O 

TO 

CM 

CO 

TO 

rH  P 

•H 

•  • 

•  • 

• 

« 

•  • 

• 

• 

• 

• 

© 

S 

f"  o 

CM  CM 

CM 

CO 

T?  rH 

ID 

CM 

CM 

CM 

-h  *a 

rH  CM 

TO  rH 

O  G 

« 

©  © 

O  O' 

to 

rH 

r-' 

CT\  rH 

TT 

O 

in 

rH 

o. 

^  rH 

TO  *»■ 

00  00 

O' 

o 

ID  rH 

rH 

r-~ 

CA 

CM 

©  © 

©  rd 

•  • 

•  • 

• 

• 

•  • 

m 

• 

• 

©  © 

p 

0,-0 

TO  00 

O'  o 

rH 

TO 

TO  rH 

in 

rH 

rH 

CM 

O' 

a 

Oi-H 

H  H 

CM  rH 

© 

© 

o  a 

©  m 

p 

u  © 

© 

i  ® 

TO  rH 

O'  TO 

CM 

TT 

rH  ID 

■V? 

in 

rH 

3  © 

P 

M  H 

00  00 

O'  CO 

00 

a\ 

o 

O 

ID 

00 

rH 

©  E 

G 

® 

•  • 

•  • 

• 

• 

•  • 

• 

• 

• 

• 

O 

M 

s 

r-' 

00  o 

rH 

CM 

CO  *H 

in 

rH 

rH 

CM 

DiP  « 

O  -H 

rH  r- 1 

CM  iH 

X  O  rH 

a 

©  -H 

P  © 

TT  VO 

r- 

00 

TO 

00  TO 

in 

CM 

o 

00 

©  ©  P 

00  CM 

TO  O' 

■vr 

ID 

o\  00 

00 

CM 

TO 

P  A  © 

©  s  >o 

O 

•  • 

•  • 

• 

• 

•  • 

• 

• 

• 

• 

CM  TO 

in  oo 

rH 

CM 

CM  O 

co 

rH 

rH 

rH 

6  3 

H  rH 

CM 

-H  G  © 
p  p 

m  ©  O 

rH  TO 

00  rH 

o 

TO 

ID  TO 

t'- 

•vr 

r- 

©  O*  E 

00  00 

00  VO 

o 

rH  in 

00 

00 

o 

rH 

p  P 

O 

•  • 

•  • 

• 

• 

•  • 

• 

• 

• 

• 

©  ©  P 

PI 

O'  rH 

rH  ID 

rH 

rH 

CM  O 

CM 

o 

•H 

rH 

©H  0 

H 

CM 

G  UH 

3  © 

® 

co  co 

00  CM 

00 

O' 

rH  TO 

TO 

O' 

o\ 

ID 

A  P 

H 

O'  O' 

CM  CM 

O' 

rH 

I"  rH 

TO 

a\ 

CM 

><P  X 

TO 

•o 

•  • 

•  • 

• 

• 

•  • 

• 

• 

• 

• 

rH  © 

K 

•o 

TO  00 

O  rH 

rH 

TO 

TO  rH 

in 

rH 

rH 

CM 

A  P  P 

TO 

-h 

rH  rH 

TO  rH 

©  O 

a 

TO  CM 

O'  CM 

rH 

O 

rH  •'T 

rH 

rH 

ID 

A  © 

O  ©  A 
P  ©  P 

•G 

rH 

rH  TO 

00 

CM  "VT 

in 

CO 

TT 

00 

04  3 

O' 

•  • 

•  • 

• 

• 

•  • 

• 

• 

• 

• 

©  © 

-H 

00  tH 

TO  CM 

CM 

CO 

T  rH 

ID 

CM 

CM 

CM 

©  O  © 

S 

rH  CM 

TO  rH 

P  ©  TO 
©XJ 

■o 

G 

0 

©  ©  P 

-H 

«  « 

©  © 

•CP© 

rH  P 

rH  00 

TO  CM 

00 

rH  O 

rH 

O 

O 

TO 

P  3  P 

«  « 

CM  O 

CM  TO 

00 

o 

ID  rH 

rH 

I"'- 

O' 

rH 

©  3 

P  .H 

•  • 

•  • 

• 

• 

•  • 

• 

• 

• 

• 

«H  o  cu 

O  3 

TO  00 

O'  o 

rH 

TO 

TO  rH 

ID 

rH 

rH 

CM 

rH  Qi  £ 

eh  o. 

rH  rH 

CM  rH 

©  X  O 

0 

©  O 

Pi 

©  P  © 

A 

©©£ 

G 

P 

E  ©  P 

© 

0 

-H 

©  x: 

© 

P 

P 

H 

s 

•G  0"*H 

A 

© 

A 

p 

A 

P  -H  o 

O'  G  P 

©  2  P 

©  rH  © 

O' 

© 

P 

XI 

A 

G  0-r| 

‘OD 

A  -H  p 

P 

H 

C 

G 

•rl 

G 

P  G 

•h  Eh 

CO  © 

P 

P 

O  U  G 

« 

© 

© 

s 

0 

3  *w  O 

o,  G  -h 

P  ©  c 

3 

G 

© 

rH 

rH 

•W 

0  O  -rl 

©  ©  p 

©  04  -H 

0 

G 

3  ©  6 

c 

04 

Qi 

G 

P 

4H  P 

® 

i  ©x  as 

A 

•H 

0  © 

© 

© 

X 

0 

U 

Q  © 

e 

*  0 

0  G 

« 

P 

U-H 

© 

g 

•H 

w 

-rl 

© 

p  ©  p 

® 

0>S 

*•  P  0 

© 

i  »H 

P 

>i  3 

p 

a  p 

<4-1 

©  Q  -H 

•C 

®  G 

©  -rl 

O'  S  rH 

>,  o,  o>  p  s 

« 

©  P  D 

O 

© 

P  O  E 

O  -H  « 

o  m  ffi 

G 

0  C  P  E  < 

© 

G 

P  -rl 

© 

-H 

*H  (0  H 

©POT 

CPflHft 

0  -rl  H 

©  >0 

G 

o  u 

©  «p  a  p  a 

PtHH 

©  0  TO 

©©©X 

p 

P 

©  G-rl 

©-H  A 

©  D 

«  D 

u 

A  EDH  ©  ap 

0  0 

3  0© 

©  © 

O 

CHP-H 

© 

©  © 

0  1 

OP© 

>iP  P 

ops 

o 

G 

O  0, 

P 

© 

© 

© 

A  &  CP 

a  P  TO  -H  fi  -rl 

G 

© 

© 

©  © 

o 

©  a-rl 

ax«w 

©•C-H.fi 

AA  © 

I  DP(D>BQ<>>TO>ft(ZDUWE 

0  TO  P  Q  P 

P  P 

I 

P  0 

i  •  • 

•  • 

• 

• 

• 

• 

• 

• 

• 

• 

0  G  P 

«H  cm  co  in 

TO 

r»- 

00  O' 

o 

rH 

CM 

04  *H  © 

rH 

rH 

rH 

© 

Because  of  a  programming  error,  the  twelfth  theme  was  terminated  after  twelve  rather 
than  thirteen  time  periods  or  68  rather  than  75  messages. 
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undercounting  which  has  taken  place  for  each  of  the 
themes  due  to  the  large  number  of  messages  in  the  themes. 
The  comparison  shows  that  for  theme  one,  by  the  thir¬ 
teenth  time  period  20  percent  of  the  exposure  events 
were  not  counted.  For  theme  two  the  figure  is  21  per¬ 
cent,  for  theme  three  49  percent,  and  for  theme  four 
the  figure  is  9.8  percent.  The  only  other  significant 
undercounting  is  in  theme  nine,  about  5.5  percent  of 
the  exposure  events.  Of  course,  this  undercounting  is 
not  random.  It  will  probably  be  largest  in  those  cells 
which  have  the  highest  probabilities  of  exposure.  In 
the  data  of  Tables  VIII-11  and  VIII-12  the  highest  aver¬ 
age  number  of  exposures  for  the  first  four  themes  should 
be  even  higher.  In  the  graphs  (below)  plotting  the 
growth  of  exposure  for  each  of  the  themes,  for  the  first 
four  themes  we  have  extrapolated  from  the  initial  time 
periods  when  the  undercounting  is  minimal  to  the  correct 
total  number  of  exposures  for  the  theme  in  the  later 
time  periods. 

Noting  these  problems  with  the  data  for  the  first 
four  themes  and  especially  for  the  third  theme,  we  turn 
to  the  average  number  of  exposures  for  the  classes  of 
each  of  the  population  dimensions  in  Table  VIII-11. 

These  average  exposures  exhibit  a  wide  variation  from 
theme  to  theme  and,  for  a  given  theme,  within  socio¬ 
economic  classes  and  educational  levels.  In  every  case. 


however,  we  find  males  are  more  exposed  than  females, 
people  with  higher  education  are  more  exposed  than  peo¬ 
ple  with  lower  education,  those  of  higher  SES  are  more 
exposed  them  those  of  lower  SES,  those  with  higher  in¬ 
terest  cure  more  exposed  than  people  of  lower  levels  of 
interest.  Only  for  the  dimension  age  does  the  most  ex¬ 
posed  subgroup  vary  from  theme  to  theme,  but  the  dif¬ 
ferences  in  exposure  are  so  small  as  to  be  negligible. 

We  also  note  that  the  ratio  of  the  highest  average  number 
of  exposures  to  the  lowest  for  each  of  the  dimensions 
is  relatively  constant  from  theme  to  theme. 

Cumulative  Exposure:  The  Most  and  Least  Exposed 
Subgroups  by  Sex,  by  Education,  by  SES 

Table  VIII-12  shows  the  most  exposed  and  the  least 
exposed  subgroups  by  sex,  by  education,  and  by  SES  for 
each  of  the  themes,  and  the  average  number  of  exposures 
for  each  of  these  subgroups.  There  are  several  inter¬ 
esting  features  to  these  data.  First,  we  might  expect 
that  the  most  highly  exposed  group  would  be  college 
males  of  high  SES;  we  find  instead  that  it  is  the  high 

O 

school  or  grade  school  educated  males  of  high  SES. 

These  findings  prompted  us  to  look  at  the  audience 

g 

The  exceptions  to  this,  the  groups  of  college  edu¬ 
cated  males  and  females,  of  low  SES,  must  be  discounted 
since  these  groups  have  only  two  and  one  members,  re¬ 
spectively,  and  therefore  cure  not  large  enough  to  pro¬ 
vide  any  meaningful  estimate  of  exposure. 


» 


i 

I 
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Table  VIII-13.  The  Percentages  of  the  Sex  by  SES  by  Edu¬ 
cation  Subgroups  in  the  Average  Audiences 
of  Each  of  the  Three  Weekday  Newspapers 


Male 

Female 

SES  College 

High 

School 

Grade 

School 

College 

High 

School 

Grade 

School 

Enquirer 

High 

74 . l%a 

22.0% 

77.4% 

71.7% 

55.5% 

46.3% 

Middle 

70.2 

55.3 

45.4 

44.4 

36.2 

30.3 

Low 

*b 

28.3 

11.7 

* 

34.4 

20.1 

Post 

High 

72.2% 

74.9% 

64.5% 

71.7% 

59.7% 

64.8% 

Middle 

62.4 

52.4 

71.2 

69.4 

49.5 

48.0 

Low 

* 

46.0 

36.9 

* 

57.3 

38.6 

Time- Star 

High 

53.2% 

39.6% 

77.4% 

46.7% 

52.6% 

41.7% 

Middle 

54.6 

64.9 

49.5 

30.6 

59.7 

47.3 

Low 

* 

51.3 

56.3 

* 

40.1 

44.8 

aThe  data 

are  the  percent 

of  each  subgroup 

in  the  . 

audience 

Thus  74.1  percent  of  the  college-educated  males  of  high 
SES  were  in  the  audience  of  the  Enquirer . 

bThe  asterisks  indicate  that  the  number  in  the  population 
subgroup  in  the  NORC  sample  was  too  small  to  estimate  the 
audiences . 
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distributions  of  those  vehicles  for  which  the  input 
data  specify  this  distribution  by  sex,  education,  and 
SES.  Since  the  radio  audiences  were  specified  only  by 
sex  (and  were  otherwise  distributed  randomly  by  the 
parameter  estimation) ,  the  vehicles  of  interest  are  the 
newspapers,  especially  the  weekday  newspapers. 

The  individual  audiences  of  these  three  newspapers 
as  percentages  of  each  subgroup  are  shown  in  Tabl^ 
VIII-13.  Of  the  high  SES  males,  those  with  grade  school 
education  seem  to  have  the  most  coverage  (an  average  of 
73.1  percent),  followed  by  the"  college  educated  (a  66.5 
percent  average) ,  and  finally  the  high  school  educated 
(a  45.5  percent  ave- age) .  This  may  explain  in  part  the 
high  exposure  of  the  grade  school  educated,  but,  of 
course,  the  actual  exposure  depends  also  upon  the  ve¬ 
hicle  distribution  of  the  messages,  their  formating, 
and  the  ratios  of  message  exposure  probabilities. 

A  second  interesting  feature  of  the  table  is  that 
the  same  subgroups  tend  to  be  highly  and  little  exposed 
from  theme  to  theme*  Thus,  although  the  magnitude  of 
exposure  varies,  the  relative  flow  of  messages  seems  not 
to  vary.  Since  we  have  not  been  able  to  establish  dif¬ 
ferent  ratios  of  message  exposure  probabilities  from 
theme  to  theme,  the  only  possible  way  for  the  distribu¬ 
tion  of  exposure s  to  vary  from  theme  to  theme  is  if  the 
distribution  of  messages  i;i  the  vehicles  (and  thus  their 
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audiences)  tends  to  vary.  Evidently  they  don't.  We 
shall  look  more  closely  at  this  phenomenon  when  we  dis¬ 
cuss  the  correlations  of  exposures  with  changes  in  the 
panel. 

Finally  we  note  the  wide  range  in  exposure  between 
the  most  and  least  exposed  subgroups.  Excepting  the 
first  four  themes  where  the  data  must  underestimate  the 
highest  exposures  by  about  a  factor  of  two,  the  most 
exposed  groups  have  nine  to  fifteen  times  the  exposure 
of  the  least  exposed  group.  The  dimensions  sex,  educa¬ 
tion,  and  SES  do  seem  to  explain  much  of  the  variation 
in  exposure. 

Cumulative  Exposure;  The  Most  Important  Media  Vehicles 

Tables  VIII-14  and  VIII-15  show  the  distribution 
of  the  number  of  exposure  events  for  the  media  types 
and  the  most  important  media  vehicles.  The  exposures 
by  media  type  indicate  that  the  combination  c£  the  after¬ 
noon  papers  was  responsible  for  most  of  the  exposures  to 
the  themes  in  the  scenario  and  the  newspapers  combined 
accounted  for  35  to  70  percent  of  the  exposures  of  the 
population  to  the  themes . 

Figures  in  Table  VIII-15  represent  the  proportion 
of  the  total  exposures  to  each  theme  which  are  accounted 
for  by  each  of  the  six  most  important  media  vehicles. 

For  the  first  four  themes  involving  the  important  news 
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stories  of  the  period — matters  of  peace  and  war,  great 
power  conflicts,  dissension  in  the  United  Nations,  etc., — 
the  vehicle  carrying  the  largest  proportion  of  exposures 
was  always  the  newspaper,  the  Post.  For  both  themes  one 
and  four  the  Post  is  the  most  important  by  far  of  the 
media  vehicles;  in  the  other  two  it  is  followed  closely 
by  the  Time-Star.  The  average  audiences  of  the  three 
newspapers  (for  the  Time-Star  50.2  percent  of  the  popu¬ 
lation,  for  the  Post  47.6  percent,  and  for  the  Enquirer 
39.2  percent)  indicate  that  the  increased  number  of  ex¬ 
posures  of  the  Post  over  the  Time -Star  cannot  be  accounted 
for  simply  by  the  differences  in  newspaper  audiences  but 
by  a  combination  of  number  of  stories  carried  and  the 
formating  of  these  stories  in  the  newspaper.  For  these 
important  news  stories  the  Time-Star  was  the  second  most 

9 

important  vehicle,  often  followed  by  the  Enquirer.  Us¬ 
ually  the  weekday  evening  news  broadcasts  at  six,  seven, 
and  eight  o'clock  were  additional  important  vehicles  in 
exposing  the  population  of  these  news  stories. 

For  several  of  the  themes  with  low  message  frequen¬ 
cies  and  more  specialized  kinds  of  messages,  e.g. 

9 

In  conversations  with  the  author,  several  members 
of  the  publicity  committee  for  the  information  campaign 
remarked  that,  among  the  three  newspapers,  they  had  the 
most  success  in  placing  news  items  in  the  Post.  Also, 
at  that  time  the  Enquirer  was  probably  the  most  liberal, 
the  Time-Star  the  least  liberal,  of  the  newspapers. 


explanation  of  the  United  Nations#  messages  about  the 
Cincinnati  Plan  sponsors#  or  the  United  Nations  and  its 
relationship  to  human  rights  (themes  eight#  nine  and 
ten) #  the  Sunday  newspaper  was  relatively  more  important 
reflecting  the  special  character  of  these  news  features . 
Also#  for  the  ninth  theme#  the  Saturday  evening  news 
broadcasts  were  important. 

In  one  sense,  however#  these  comparisons  of  media 
vehicles  were  misleading  in  that  a  weekday  news  vehicle 
accumulates  average  exposures  over  six  days  of  each 
week  while  the  weekend  news  vehicles  appear  only  once 
in  each  week.  Perhaps  a  more  legitimate  comparison 
would  be  made  by  comparing  one -sixth  of  the  weekday  ex¬ 
posures  and  one-fifth  of  the  weekday  news  broadcasts  to 
the  exposures  of  the  weekend  vehicles. 

Cumulative  Exposure:  The  Duplication  of  Exposure  Across 
Themes 

The  final  table#  Table  VIII-16  shows  the  expected 
audience  duplication  between  each  of  the  first  three 
themes  and  the  other  themes  by  sex,  by  education,  and 
by  socio-economic  status.  As  originally  conceived  this 
kind  of  statistic  would  allow  one  to  explore  the  clus¬ 
tering  of  exposures  to  the  various  themes  and  therefore 
perhaps  explain  some  of  the  interrelatedness  of  the  at¬ 
titudes  or  information  within  the  population.  It  will 
be  recalled  that  because  of  the  limitations  of  space  in 
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the  computer  the  duplication  statistics  were  a  compro¬ 
mise;  we  might  like  to  know  the  audience  common  to  any 
possible  combination  of  themes  in  the  simulation,  by 
any  of  the  dimensions  of  the  populations,  for  any  time 
period.  Unfortunately ,  there  is  not  enough  capacity 
in  the  present  machine  zo  maintain  all  of  these  statis¬ 
tics.  In  fact,  there  is  not  enough  capacity  in  the 
machine  to  calculate  duplications  between  every  pair  of 
themes  and  so  we  have  settled  upon  the  compromise  of 
allowing  duplication  statistics  between  any  theme  and 
each  of  the  first  three  themes  run  in  the  simulation. 
Therefore  one  must  decide  in  advance  which  particular 
duplications  are  of  greatest  interest  and  make  those 
themes  which  are  of  central  importance  the  first  three 
themes  of  the  simulation. 


In  addition,  there  is  another  problem  with  the 
statistics  generated  by  the  duplication  calculations. 
The  duplication  is  calculated  in  the  following  manner: 
for  each  individual  the  cumulative  probability  of  not 
having  been  exposed  at  all  to  each  theme  is  maintained 
(or  some  variant  of  this  that  allows  the  calculation 
of  this  probability)  for  each  individual.  From  these 
non-exposure  probabilities,  we  calculate  for  each  theme 
the  probability  of  at  least  one  exposure,  the  product 
of  these  probabilities  for  any  pair  of  themes  is  the 
probability  of  exposure  to  both  of  the  themes.  The  sum 


Table  VIII-16.  The  Expected  Audience  Duplication  Between  the  First  Three  Themes  and 

Each  of  the  Other  Themes  by  Sex,  Education,  and  SES 
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of  these  probabilities  of  exposure  for  a  population 
subgroup  or  cell  is  the  expected  number  of  people  in  the 
subgroup  exposed  at  least  once  to  each  of  the  pair  of 
themes.  Of  course,  this  expected  number  includes  peo¬ 
ple  who  are  exposed  only  once  to  each  theme  or  once  to 
one  theme  and  twice  to  another  theme,  etc.  As  we  have 
argued  above,  it  would  be  preferable  to  have  the  statis¬ 
tic  which  represented  the  expected  number  of  people  ex¬ 
posed  to  each  of  the  themes  at  least  some  minimal  number 
of  exposures .  If  five,  or  six,  or  seven  exposures  to  a 
theme  are  necessary  for  any  significant  change  over  six 
months  for  most  of  the  kinds  of  messages  and  themes  in 
the  present  simulation,  then  we  would  prefer  to  know 
the  expected  number  of  people  exposed  at  least  five 
times  to  each  of  the  two  themes. 

Since  the  statistic  calculated  in  the  present  simu¬ 
lation  is  the  proportion  of  each  subgroup  exposed  at 
least  once  to  one  of  any  of  the  first  three  themes  and 
any  one  of  the  other  themes,  and  since  the  percentage 
of  any  subgroup  in  any  of  the  first  three  themes  who 
are  exposed  at  least  once  is  extremely  high,  approxi¬ 
mating  one  hundred  percent  (Table  VIII-10  shows  1984.5 
of  a  total  of  2000  persons  exposed  at  least  once  to  the 
first  theme) ,  then  the  statistics  presented  in  this 
table  represent  almost  exclusively  the  proportion  in 
the  subgroups  exposed  at  least  once  to  themes  other 
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than  themes  one,  two,  or  three.  In  fact  these  statis¬ 
tics  are  almost  identical  to  the  percentage  of  each 
audience  type  exposed  for  each  of  the  other  themes. 

This  makes  the  table  of  duplications  rather  uninterest¬ 
ing  since  it  simply  reflects  the  exposure  to  themes 
other  than  the  first  three  themes.  Had  we  anticipated 
this  result,  we  could  have  run  other  themes  as  the  first 
three  themes,  and  gotten  at  least  some  duplication 
statistics  between  the  less  frequently  occuring  themes. 

There  are  few  surprises  in  these  data.  Exposure 
is  distributed  across  the  dimensions  of  the  population 
in  just  about  the  way  which  one  might  expect,  except 
that  there  were  some  interactions  when  we  put  these 
dimensions  together,  probably  due  to  the  proportionally 
larger  newspaper  vehicle  audiences  in  th-  high  school 
and  grade  school  educated  males.  The  weekday  news¬ 
papers  and  the  morning,  mid-day  and  evening  news  broad¬ 
casts  are  the  most  important  vehicles,  in  terms  of  num¬ 
ber  of  exposures.  The  very  useful  fact  about  these  data 
is  that  they  provide  actual  number  of  exposures  for  each 
population  subgroup  to  the  themes  in  the  media  system 
and  trace  the  growth  of  these  exposures  over  time  per¬ 
iods.  These  numerical  data  can  be  incorporated  into  a 
model  of  some  effects  of  exposure  to  the  media  system. 
Following  the  graphs  of  the  growth  of  cumulative  expo¬ 
sure,  we  will  propose  two  such  models  and  examine  the 
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resulting  correlations  between  exposure  to  the  various 
themes  and  changes  in  information,  attitudes  and  opin¬ 
ions  in  the  population  over  the  six  months.*0  First, 
however,  we  show  the  growth  of  exposure  for  several 
dimensions  over  the  thirteen  two-week  time  periods  for 
the  twelve  themes  of  the  scenario. 


The  Growth  of  Cumulative  Exposure 

The  first  three  graphs  displayed  below  show,  for 
the  first  theme,  the  growth  of  the  percentage  exposed 
at  least  once,  by  sex,  education,  and  SES.  As  we  noted 
above  in  the  discussion  of  Table  VIII-10,  nearly  every¬ 
one  is  exposed  at  least  once  by  the  thirteenth  time 
period  for  all  of  the  themes.  In  the  early  time  periods, 
there  are  small  differences  between  subgroups  in  the 
cumulative  percent  exposed,  but  these  differences  dis¬ 
appear  quite  quickly.  Since  these  statistics  are  not 
as  interesting  as  the  cumulative  number  of  exposures 
and  since  most  of  the  curves  for  most  of  the  themes 
grow  to  greater  than  ninety  percent  exposed,  we  turn 


°We  note  that  the  simulation  is  not  presently 
programmed  in  the  best  way  to  easily  attach  a  model  of 
change  or  conversion  in  the  population  or  the  spreading 
of  information  by  word  of  mouth  resulting  from  the  ex¬ 
posures  to  the  messages.  Because  of  the  limited  cap¬ 
acity  of  the  machine,  we  can  not  store  and  maintain  the 
distribution  of  numbers  of  people  with  different  num¬ 
bers  of  exposures  for  each  population  subgroup.  How¬ 
ever,  in  the  reprogrammed  version,  these  data  will  be 
stored  and  available  at  a  later  time. 


Least 


Figure  VIII-14.  The  Cumulative  Percentage  of  College  (  +  ),  High  School  (  • 
and  Grade  School  ( . )  Educated  People  Exposed  at  Least  Once  to  the 

First  Theme 


Fig.ure  VIII-15.  The  Cumulat 
Low  (#)  SES  People  F 


now  to  the  graphs  of  the  growth  of  the  cumulative  num¬ 
ber  of  exposures. 

The  Cumulative  Exposures  of  Several  Population  Subgroups 
to  Each  of  the  Simulation  Themes 

We  have  noted  above  that  the  record  of  exposure  by 
population  type  for  each  of  the  first  four  themes  is 
inaccurate  and  underestimates  the  true  number  of  expo¬ 
sures  because  of  the  computer  storage  capacity  problem 
and  the  high  number  of  messages  for  each  of  these 
themes.  However,  from  the  record  of  exposures  either 
by  media  or  media  types,  we  have  accurate  values  for 
the  expected  number  of  exposures  for  the  entire  popula¬ 
tion,  and  therefore  for  the  average  number  of  exposures 
for  the  population  as  a  whole.  If  we  look  at  exposures 
for  any  exhaustive  set  of  subgroups,  e.g.,  males  and 
females  or  males  and  females  of  college,  high  school, 
and  grade  school  education,  then  the  true  average  expo¬ 
sures  of  these  subgroups  of  the  simulation  population 
must  distribute  themselves  about  the  population  average 
value . 

In  the  graphs  below  showing  the  cumulative  number 
of  exposures  for  various  subgroups  of  the  population, 
we  have  for  the  first  four  themes  attempted  to  correct 
for  the  under counting.  Let  us  take  as  an  example  the 
case  of  the  third  theme  (the  worst  case)  in  which  at 
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the  end  of  thirteen  time  periods  the  number  of  exposures 
counted  in  the  subgroups  was  only  fifty  percent  of  the 
actual  total  number  of  exposures.  In  the  first  time 
periods ,  before  the  number  of  messages  grows  large,  the 
error  in  cumulating  exposures  is  small;  however,  by  the 
fifth  time  period  the  error  equals  8.3  percent,  i.e., 
the  exposures  in  the  subgroups  underestimate  the  true 
exposures  by  about  8.3  percent.  Recalling  the  discus¬ 
sion  of  the  source  of  this  error  above  (p.  343)  iJ~ 
seems  likely  that  in  any  set  of  population  subgroups, 
the  smaller  the  average  number  of  exposures  actually 
counted,  the  less  the  percent  of  error  in  that  count. 
Thus,  for  the  graph  and  growth  of  exposure  by  sex  for 
the  third  theme,  since  the  females  are  the  least  expo¬ 
sed  of  the  two  groups,  it  is  probable  that  the  degree 
of  underestimation  of  their  exposure  is  less  than  that 
of  the  males.  Since  the  amount  of  underexposure  as  a 
whole  at  the  fifth  time  period  is  8.3  percent,  we  might 
estimate  that  for  the  females  the  amount  of  underexpo¬ 
sure  is  on  the  order  of  5  percent  compared  with  perhaps 
10  or  11  percent  for  the  males.  Taking  the  value  of  5 
percent  for  the  females  we  calculate  a  revised  estimate 
of  the  number  of  exposures.  From  the  total  number  of 
exposures  for  the  population  at  the  fifth  time  period 
and  the  number  of  exposures  for  the  females ,  the  dif¬ 
ference  is  the  number  of  exposures  for  the  males ,  from 
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which  the  average  is  easily  calculated.  We  have  pro¬ 
ceeded  in  this  fashion  for  each  time  period,  producing 
more  realistic  values  of  exposures  for  the  various  sub¬ 
groups  of  the  population. 

For  each  theme  we  have  plotted  the  growth  of  ex¬ 
posure  by  sex,  education,  and  socio-economic  status. 

The  six  month  interval  is  divided  into  thirteen  two- 
week  time  periods,  beginning  approximately  September 
15,  1947  and  ending  March  15,  1948.  Thus,  the  end  of 
the  first  simulated  time  period  occurs  on  September 
29,  1947. 

During  the  six  month  period  the  most  important 
international  news  events  (the  events  most  in  the  news¬ 


paper  headlines)  concerned  the  near  war  between  Jews 
and  Arabs  in  Palestine  and  partitioning  of  the  area. 
Thera  were  constant  bombings,  rioting,  and  violent  en¬ 
counters  between  the  Jews,  Arabs,  and  the  British  who 
nominally  were  in  control.  On  November  11,  1947,  the 
United  States  and  Russia  came  to  an  agreement  in  the 
United  Nations  to  partition  Palestine.  On  November  29, 
the  United  Nations  General  Assimbly  approved  the  par¬ 
tition  and  the  Arab  states  walked  out,  declaring  that 
the  partitioning  meant  war.  From  this  point  on,  the 
violence  increased  markedly  with  both  Jewish  and  Arab 
mobs  roaming  the  streets  of  Jerusalem,  burning  and 
looting.  By  February,  the  United  Nations  was  attempting 
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to  raise  a  force  to  replace  the  British  in  Pales¬ 
tine. 

Palestine,  however,  was  not  the  only  tumultuous 
spot  on  the  globe.  In  December,  communist-led  strikes 
of  dock  and  utility  workers  in  France  and  Italy  halted 
most  transportation  and  commerce.  Italians  and  Yugo¬ 
slavs  battled  over  elections  in  Trieste.  During  Janu¬ 
ary,  1948,  a  communist  government  and  guerrilla  move¬ 
ment  was  formed  in  Greece  and  attacked  towns  near  the 
Albanian  border.  The  United  States  responded  to  both 
of  these  situations  by  dispatching  1000  marines  to 
waters  near  Trieste  and  Greece.  In  the  far  east.  United 
States  and  Russian  armies  faced  each  other  in  Korea, 
and  Pakistan  sent  4000  troops  against  Indians  in  Kash¬ 
mir. 

Also,  in  January,  the  United  States  succeeded  in 
forming  a  United  Nations  "little  assembly"  which  was 
to  remain  in  session  throughout  the  year;  however, 

Russia  and  the  satellites  boycotted  it.  In  early 
February,  Russia  called  for  debate  in  the  United  Na¬ 
tions  on  the  "slave  labor"  Taft-Hartley  lciw. 

These,  then,  were  the  events  which  produced  the 
messages  measured  in  the  content  analysis.  The  graphs 
of  exposure  by  time  period  follow. 
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Figure  VIII-17.  The  Average  Cumulative  Number  of  Ex¬ 
posures  for  Hales  (+)  and  Females  (• )  to  Theme  2 

(points  in  boxes  are  adjusted  figures) 
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Figure  VIII-18;  The  Average  Cumulative  Number  of 
Exposure  Events  for  Males  (+)  and  Females  (• ) 
to  Theme  3 

(points  in  boxes  are  adjusted  figures) 
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Figure  VIII-22.  -The  Average  Cumulative  Number  of  Expo¬ 
sures  for  Males  (+)  and  Females  (•)  to  Theme  7 
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Figure  VIII-23.  The  Average  Cumulative  Number  of  Expo  r 
sures  for  Males  C+)  and  Females  (•)  to  Theme  8 
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Figure  V1II-24.  The  Average  Cumulative  Number  of 
Exposures  for  Males  {  +  )  and  Females  (•  )  to  Theme.  9 
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Figure  VIII-25.  The  Average  Cumulative  Number  of 
Exposures  for  V:*les  (  +  )  and  Females  (*)  to  Theme  10 
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Figure  VIII-28.  The  Average  Cumulative  Number  ojf 
Exposures  For  College  (♦),  High  School  (®> ,  and 
Grade  School  («)  Education  to  Theme  1 

(points  in  boxes  are  adjusted  figures) 
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Figure  VIII-30.  The  Average  Cumulative  Number  of  Expo¬ 
sures  For  College  (+),  High  School  (©) ,  and  Grade 
School  ( • )  Education  to  Theme  3 

(points  in  boxes  are  adjusted  figures) 
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Figure  VIII-31.  The  Average  Cumulative  Number  of 
Exposures  for  College  (♦),  High  School  (Q),  and 
Grade  School  (•)  Education  to  Theme  4 

(points  in  boxes  are  adjusted  figures) 


Number  of  Exposures  Number  of  Exposures 


VETO  POWER 


•  * 


1  2  3  4  5  6  7  8  9  10  11  12  13 

Time  Period 

igure  VIII-32 .  The  Average  Cumulative  Number  of  Expo 
sures  for  College  (•»•),  High  School  (6 ) ,  and  Grade 

School  (•)  Education  to  Theme  5 
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Figure  7III-33.  The  Average  Cumulative  Number  of  Expo¬ 
sures  For  College  (+),  High  School  (#),  and  Grade 
School  (•)  Education  to  Theme  6 
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Figure  VIII-36.  The  Average  Cumulative  Number  of  Exposures 
For  College  (  +  ),  High  School  (6>),  and  Grade  School  -(# ) 
Education  to  Theme  9 


.cure  VIII-37.  The  Average  Cumulative  Number  of  Exposures 
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Education  to  Theffle  it) 
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Figure  VIII-38.  The  Average  Cumulative  Number  of  Exposures 
For  College  (  +  ),  High  School  «3>) ,  and  Grade  School  (#) 
Education  to  Theme  11 
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Figure  VIII-39.  The  Average  Cumulative  Number  of  Exposures 
For  College  (+),  High  School  (©),  and  Grade  School  (•) 
Education  to  Theme  12 
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Figure  VIII-41.  The  Average  Cumulative  Number  of  Ex¬ 
posures  for  High  (♦),  Middle  (©) ,  and  Low  <•)  SES  to 

Theme  2 

(points  in  boxes  are  adjusted  figures) 
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Figure  VIII-42.  The  Average  Cumulative  Number  of 
Exposures  for  High  (♦),  Middle  (0),  and  Low  (*) 
SES  to  Theme  3 

(points  in  boxes  are  adjusted) 
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Figure  VIII-43.  The  Average  Cumulative  Number  of  Ex¬ 
posures  for  High  (  +  ),  Middle  (O) ,  and  Low  (•)  SES 

to  Theme  4 

(points  in  boxes  are  adjusted  figures) 
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Figure  VIII-45.  The  Average  Cumulative  Number  of  Ex¬ 
posures  for  High  (+),  Middle  ,  and  Low  (#) 

SES  to  Theme  6 
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Figure  VIII-46.  The  Average  Cumulative  Lumber  of  Expo¬ 
sures  for  High  (+) „  Hiddle  (©),  and  Low  (•)  SES 

to  Theme  7 
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Figure  VIII-47.  The  Average  Cumulative  Number  of 
Exposures  for  High  (+) ,  Middle  (0) ,  and  Low  (#) 

SES  to  Theme  8 
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Figure  VIII-48 .  The  Average  Cumulative  Number  of  Ex¬ 
posures  for  High  (  +  ),  Middle  (<&),  and  Low  (*)  SES 

to  Theme  9 


Figure  VIII-49.  The  Average  Cumulative  Number  of  Ex¬ 
posures  for  High  <+),  Middle  (©) ,  and  Low  (•)  SES 

to  Theme  10 
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Figure  VIII-50.  The  Average  Cumulative  Number  of  Ex¬ 
posures  for  High  (+),  Middle  (0),  and  Low  («)  SES 

to  Theme  11 
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Figure  VIII-51.  The  Average  Cumulative  Number  of 
Exposures  for  High  (+),  Middle  (0),  and  Low  (•)  SES 

to  Theme  12 
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In  addition  to  the  face  validity  of  the  internal  logic 
of  the  model  and  the  event  validity  of  some  of  the  assump¬ 
tions  of  the  model  as  tested  against  real  cumulation  data 
(this  is  described  in  the  Appendix  to  Chapter  V) ,  and  the 
plausibility  of  the  records  of  exposures  generated  by  the 
media  system  in  both  the  trial  and  the  real  scenarios*  we 
would  also  like  to  establish  a  more  difficult  validation? 
namely,  the  correlation  of  the  expected  number  of  exposures 
to  various  themes  with  the  changes  found  in  the  NORC  panel 
over  the  six-mcnth  period,  across  several  population  types. 
There  are  several  difficulties  in  attempting  this  kind  of 
correlation.  First,  we  must  define  what  we  mean  by  change 
in  the  panel,  somehow  sorting  out  those  changes  which 
might  reasonably  be  assumed  to  have  resulted  (in  part) 
from  exposure  to  messages  in  the  media  system  from  those 
changes  which  seem  to  be  unrelated  to  exposure  and  are  per¬ 
haps  more  or  less  random  fluctuations  in  public  opinion* 
Second,  it  is  not  at  all  obvious  that  one  can  construct 
some  simple  model  of  change  as  measured  by  survey  data, 
that  can  be  correlated  with,  or  causally  related  to,  expo¬ 
sure  to  messages  in  the  mass  media.  Third,  it  is  quite 
likely  that  exposure  to  the  same  messages  in  ths  mass  media 
causes  different  directions  of  change  in  different  audience 
types  in  the  population,  and  indeed,  in  different  individuals 
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within  the  same  audience  type.  We  shall  not  attempt  to 
account  for  changes  in  different  directions,  upon  exposure 
to  similar  messages. 

In  addition,  we  must  remember  that  the  data  from  which 
the  simulation  model  of  the  media  system  was  constructed 
was  itself  imperfect — often  inaccurate  or  lacking — and  of¬ 
ten  had  to  be  constructed  from  numerous  assumptions  and 
estimations.  Likewise  the  content  analysis  was  not  as 
reliable  as  it  might  have  been  (recall  that  all  the  radio 
messages  were  inferred  from  the  newspaper  messages)  and 
the  generation  of  message  exposure  probabilities  based  on 
format  factors,  etc.,  at  best  explained  only  half  of  the 
variance  in  message  exposure.  It  is  also  true  (as  we  shall 
see  below)  that  the  themes  coded  in  the  content  analysis 
sure  not  always  directly  and  obviously  relevant  to  items  in 
the  NORC  survey;  occasionally  the  themes  which  are  corre¬ 
lated  are  only  marginally  relevant  in  any  obvious  way  to 
the  opinions  or  information  tapped  by  the  questionnaire 
items. 

Finally,  we  shall  use  the  three  dimensions,  sex,  edu¬ 
cations,  and  socio-economic  status,  to  define  the  audience 
types  across  which  the  correlations  are  calculated.  (These 


Thus,  although  the  dimensions  by  which  the  popula¬ 
tion  is  typed  may  explain  a  large  proportion  of  the  vari¬ 
ance  in  audience  exposure,  they  are  not  necessarily  those 
dimensions  which  well  explain  attitude  or  information 
chance.  “ 


dimensions  were  chosen  because  they  produce  the  largest 

differences  in  exposure.}  These  three  dimensions  define 

12 

eighteen  audience  types f  each  of  which  we  will  divide 

into  four  cells  on  the  basis  of  before-and-after  responses 
to  the  questionnaire  items,  resulting  in  a  total  of  seventy 
two  cells.  Recall  that  the  NORC  survey  panel  consisted  of 
approximately  six-hundred  respondents,  which  when  spread 
across  the  seventy-two  cells,  produces  an  average  of  fewer 
than  nine  respondents  per  cell.  Thus,  all  things  consid¬ 
ered,  we  have  no  great  expectation  of  high  correlations 
between  the  exposures  predicted  by  the  simulation  and  the 
changes  found  in  the  survey  panel;  however,  insofar  as 
the  real  media  system  is  reasonably  well  modeled  and  errors 
in  the  construction  of  the  media  system  plus  effects  un¬ 
accounted  for  are  random,  we  should  expect  to  find  regular, 
small  correlations  between  the  exposure  data  and  some  mea¬ 
sure  of  change  in  the  panel. 

With  these  caveats  in  mind,  we  proceed  to  the  correl¬ 
ation  of  exposure  to  themes  in  the  simulation  with  the 
changes  in  information  or  opinion  measured  in  the  NORC 
survey  panel.  We  must  decide  which  questionnaire  items 
appear  to  be  closely  related  to  which  themes  in  the 


Actually  we  shall  use  only  16  of  the  18  cells  in 
the  correlations  because  two  of  the  cells,  males  and  fe¬ 
males  of  college  education  and  low  SSS,  have  only  two  and 
one  members,  respectively,  in  the  simulation  population. 
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scenario.  He  will  attempt  to  show  that  changes  in  re¬ 
sponses  to  the  questionnaire  items  correlate  positively 
with  exposure  to  those  themes  where  we  perceive  a  rela¬ 
tionship  between  the  theme  and  questionnaire  items ,  and 
that  for  other  questionnaire  items  and  other  themes  which 
are  not  related ,  the  correlations  cure  random.  Thus,  we 
test  on  the  one  hand  for  significantly  large  correlations 
where  expected,  and  on  the  other  hand  for  statistically 
insignificant  correlations.  The  first  step,  however,  is 
to  construct  some  index  of  change  of  attitude  or  opinion 
in  the  population. 

Several  Indices  of  Opinion  and  Information  Change 

One  can  think  of  several  possibilities  for  indices 
of  change  applied  to  panel  data.  For  instance,  let  us 
suppose  that  the  item  asks  some  question  about  a  matter 
cf  fact  for  which  there  is  a  definite  correct  answer.  An 
index  might  measure  the  change  within  a  group  of  people 
as  the  difference  between  the  proportion  initially  giving 
the  correct  answer  and  the  proportion  finally  giving  the 
correct  answer.  Thus,  if  sixty  per  cent  give  the  correct 
answer  initially  and  seventy-five  per  cent  give  a  correct 
answer  finally,  then  there  has  been  a  net  positive  change 
of  fifteen  per  cent.  It  is  well  known  that  this  measure 
of  change  has  a  bias,  however,  i.e.,  for  groqps  in  the  pop¬ 
ulation  which  initially  begin  at  a  very  high  proportion  of 
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correct  answers,  the  possibility  for  positive  change,  in 
terms  of  proportion  of  the  group  which  can  make  such  a 
change,  is  limited  by  the  initially  high  proportion  of 
correct  answers.  This  is  the  ceiling  effect,  which  is 
commonly  a  problem  in  measuring  such  political  phenomena 
as  conversion  or  change  of  opinion  in  a  political  cam-* 
paign. 

One  way  to  avoid  this  ceiling  effect  is  to  normalize 
the  measure ,  i . e . ,  divide  the  proportional  increase  by  the 
maximum  possible  proportional  increase  from  the  first  to 
the  second  time.  If  we  let  and  P2  represent  the  pro¬ 
portions  giving  the  correct  answer  in  September  and  in 
March,  respectively,  then  this  first  index  of  change  is 
just 


P1 


We  can  calculate  the  value  of  Index^  from  the  entries 
in  a  fourfold  table  describing  the  pattern  of  response  to 
an  item,  initially  and  finally,  in  any  group  or  audience 
type.  For  the  table  in  Figure  VIII-52,  the  initial  pro¬ 
portion  correct  is  P^  «  (a  +  c)/N  and  the  final  proportion 
is  P2  *  (a  +  b)/N. 


-  -»  rpf  KP'-  - 
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Figure  VIII-52.  A  Pattern  of  Possible  Responses  to  an 

Item  at  Two  Times 


Thus  Index^  is  given  by 


Index, 


a  +  b  a  +  c 
N  N 

i  a  +  c 
X  N 

b  -  c 
N 

N  -  (ate) 

N 


Index^  =  b  +~<i 


(VIII-1) 


A  second  possible  index  of  conversion  or  change  is 
formed  in  the  following  ways  we  calculate  the  proportion 
of  those  giving  the  wrong  answer  in  September  who  give 
the  correct  answer  in  March  and  subtract  from  this  pro¬ 
portion  the  proportion  who  seem  to  be  guessing.  In  terms 
of  the  fourfold  table  the  proportion  of  those  wrong  in 
September  who  are  correct  in  March  is  b/(b  +  d) ,  and  the 
proportion  guessing  (the  proportion  of  those  who  were 


correct  in  September  and  turong  in  March)  is  estimated  as 
c/(a  +  c) .  The  second  index  is  then 

,  b  b  c 

Index2  =  ^  ’+~ff  *  a"  4.  ~q 

or,  simplified,  Inde*2  -  •  J-f-J  (VIM-2) . 

A  third  index  can  be  calculated,  incorporating  the 
notion  of  guessing,  but  extending  it  by  recognizing  that 
even  among  those  people  who  gave  correct  answers  in  both 
September  and  March,  a  small  proportion  were  probably 
guessxng.  Let  w  represent  the  true  number  who  know  the 
correct  answer  in  September  (and  assume  that  they  »re  re¬ 
corded  correctly  in  the  survey) ,  and  let  x  represent  the 
true  number  who  do  not  know  in  September  but  who  guess 
with  an  average  probability  p  of  guessing  correctly. 

Also  let  y  and  z  represent  the  true  numbers  in  those  two 
groups  in  March.  If  we  make  the  assumption  that  no  one 
who  truly  knows  in  September  forgets  or  "unlearns"  in  the 
interval  from  September  to  March,  then  the  true  number 
who  learn  is  just  y-w  and  the  proportion  of  those  who  did 
not  know  the  correct  answer  in  September,  who  have  learned 
in  the  interval,  is  (y-w)/x.  This  is  the  third  conversion 
or  change  index  which  must  be  expressed  in  terms  of  the 
observed  quantities  a,  b,  c,  and  d. 

The  observed  number  correct  in  September  is  just 
(a  +  c) ,  which  equals  the  number  truly  knowing  in 


September  (w)  plus  the  proportion  who  do  not  know  but 
guess  correctly  (p.x) .  Thus 


?  <  Sit1?5'*!!*??!#'9'* 
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Substituting  equation  (VIII-9)  in  equation  (VIII-8)  gives 


(b  -  c) 

y-w*=b-c  +  P(|~— pj 


b  -  c 
1  -  P 


(VIII-10) . 


From  equation  (VIII-7) 


(1  -  P> 


and  substituting  for  z  in  equation  (VIII-6) 


c  +  d  *  (1  -  p)z 


d 

1  “  P 


and 


1  “  P 
P 


d 

C  +  d 


c 

c  +  d 


(VIII-X1) 
(VIIi-12) . 


Substituting  1  -  p  of  equation  (VIII-11)  in  equation 
(VIII-10)  we  find  that  the  real  number  who  have  learned 
from  September  to  March  is 


x  -  z 


(b  -  s)  (c  +  d) 

a 


=  (b  -  c)  (1  +  |) 


(VIII-13) . 


Thus  the  best  estimate  of  the  number  learning  from  Sep¬ 
tember  to  March  is  slightly  larger  than  the  first  obvious 
estimate,  (b  -  c) .  We  must  next  solve  the  equations  for 
x,  the  number  not  knowing  in  September.  Rearranging 
equation  (VIII-4) 


x(l  -  p)  *  b  ♦  d 
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or 


x  = 


b  +  d 

r^p 


and  substituting  equation  (VIII -11)  for  the  term  (1  -  p) , 

(VIII-14) . 


(b  +  d)  (c  +  d) 

a 


Thus  the  third  change  index  becomes  (from  equations 
(VIII-13  and  VIII-14)) 

Index^  =>  3L-Z-E 

_  (b  -  c)  (c  +  d) _ d _ 

cl  (b  +  d)  (c  +  d) 


or 


Index.  = 


b  -  c 

E -TS 


Therefore,  both  Index^  from  the  first  single  model,  and 
Index ^  derived  from  the  more  sophisticated  model,  have 
the  same  values  for  any  fourfold  table.  Table  Vlll-17 
shows  values  of  the  three  indices  for  several  distribu¬ 
tions  of  responses.  For  example,  the  first  row  shows  the 
case  of  maximum  change,  i.e.,  all  the  initial  responses 
were  incorrect  and  all  the  final  responses  were  correct. 

For  this  case,  a,  c,  and  d  are  zero;  Index^  and  Index^ 
equal  1.0  but  Index2  is  undefined  although  it  could 
logically  be  set  equal  to  1.0  at  this  limit.  In  general, 
the  three  indices  do  not  differ  much,  especially  if  the 
ratio  c/b  is  close  to  1.0.  Because  of  the  appeal  of  the 
model  which  underlies  Index^ ,  we  have  used  this  index  for 
measuring  change  in  the  survey  panel.  The  next  task,  then. 


^£38S2&&Z?&** sst-srurSP  <£5*!*?**  -  ^ 
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is  to  express  the  survey  date  in  the  form  of  fourfold 
tables  to  which  the  index  can  be  applied. 


Table  VIXI-17.  Several  Values  of  Three  Indices  of  Change 

in  Fpurfold  Tables  for  Certain  Special 
Cases 


t 

! 

1 

Values  of 

Indices 

f  Amount  of  Change 

Index^,  Index^ 

Xndex2 

\ 

i 

(M 

[b  +  d)  (a  +  c) 

|  Maximum  change 

|  a,  c,  df  s  0 

1.0 

undefined 

!  One-half  of  those 

initially  wrong  ohange; 
no  guessing  c  »  Oj  b  =  d 

.5 

.5 

One-half  of  those 
initially  wrong  change, 
but  one  half  guess  c  *=  a, 
b  *  d 

.25 

All  those  initially 
wroiig  change,  but  all 
guess  a  =  0,  d  =  0 

1.0  -  | 

0.0 

No  change  at  all 
c  =  0,  b  =  0 

0.0 

0.0 

The  Preparation  of  the  Survey  Data:  The  Problem  of  the 
"Correct"  answer. 


The  index  of  change  described  above  assumes  that  the 
data  can  be  put  in  the  form  of  a  fourfold  table  in  which 
one  of  the  two  possible  answers  to  the  item  is  a  correct 
answer,  in  the  sense  that  it  is  an  answer  toward  which 
information  or  attitudes  are  changing.  If  the  question 
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is  indeed  a  factual  question ,  and  only  the  correct  answer 
may.  be  presumed  to  be  abroad  in  the  mass  media,  then  the 
model  applies  relatively  directly.  For  example.  Table 
VIII-18  below  shows  the  September  and  March  responses  to 
a  factual  question  asking  whether  it  was  the  job  of  the 
United  Nations  to  protect  Human  Rights.  After  eliminat¬ 
ing  respondents  for  whom  one  or  both  of  the  answers  was 
not  ascertainable,  the  answers  to  this  factual  question 
fall  into  the  three  categories,  "Yes,"  "No,"  and  "Don't 
Know."  Table  VI2I-18  shows  that  there  was  a  slight  in¬ 
crease  (from  79.5  to  83.3  percent)  for  the  correct  answer 
("Yes")  and  corresponding  decreases  in  the  "No"  and  "Don't 
Knew"  responses.  In  this  case,  we  have  created  a  fourfold 
Table  VIII-1S  below  from  the  ninefold  table  by  combining 
into  one  category  the  "No"  and  "Don't  Know"  responses. 

Thus  in  all  the  items  relating  to  factual  matters,  the 
"Don't  Know"  responses,  if  any,  were  combined  with  the 
incorrect  responses  to  reduce  the  responses  to  a  dichot¬ 
omy. 

The  situation  is  less  clear  when  we  turn  to  matters 
of  attitude  or  opinion.  Do  the  "Don't  Know"  responses 
actually  reflect  the  respondent's  lack  of  information  or 
interest  or  do  they  represent  a  position  on  a  continuum 
somewhere  between  a  "Yes"  response  and  a  "No”  response. 


i.e.,  inability  of  the  respondent  to  decide?  In  these 
cases  we  resolve  the  matter  in  the  following  manner:  In 
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Table  VIII-18.  A  Ninefold  Table  of  Responses  Concerning  the 
United  Nations  and  Human  Rights 


September  Responses 

March  Responses 

Yes 

NO 

D.K. 

Marginals3 

Yes 

85.8% 

67.8% 

92.9% 

83.3% 

No 

10.7 

28.2 

7.1 

13.4 

D.K. 

3.4 

7.1 

0.0 

3.4 

100.0% 

100.0% 

100.0% 

(1977) 

(397) 

(112) 

Marginals3 

79.5% 

16.0% 

4.5% 

100.0% 

(N-2486) 

^he  row  and  column  marginals  are  based  on  the  total  number 
of  weighted  responses  (N=2486) .  However  the  total  number 
of  respondents  in  the  panel  was  592  of  which  thirty  percent 
were  filtered  out  prior  to  this  question  because  they  could 
not  identify  the  United  Nations. 


Table  VIII-19.  A  Fourfold  Table  of  Responses  Concerning  the 

United  Nations  and  Human  Rights 


September  Responses 


March  Responses 

Yes 

No-D.K. 

Marginals3 

Yes 

85.8% 

73.3% 

83.3% 

No -D.K. 

14.2 

26.7 

16.7 

100.0% 

100.0% 

(1977) 

(509) 

Marginals3 

79.5% 

20.5% 

100.0% 

(N*2486) 

aThe  row  and  column  marginals  cure  based  on  the  total  number 
of  weighted  responses  (N*2486) . 


> 
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the  ninefold  table  we  compare  the  marginal  responses  for 
September  and  March  to  determine  which  of  the  responses , 

"Yea"  or  "No,"  has  increased  in  frequency  during  the  in¬ 
terval  and  call  this  response  the  "correct”  response. 

Thus  we  assume  that  public  opinion  encouraged  by  the  media 

I 

output,  is  shifting  in  the  "correct"  direction.  Next  we 
note  the  direction  of  change  of  the  "Don't  Know's.”  If 
the  "Don't  Know's"  also  increase  during  tne  interval  then 
they  are  combined  with  the  correct  response;  if  they  de¬ 
crease  from  September  to  March  then  they  are  combined 
with  the  incorrect  response. 

As  an  example,  suppose  that  the  percentage  of  people 
expecting  war  increased  from  30  to  60  percent  during  the 
six  months.  If  the  percentage  of  "Don't  Know' 3"  also 
increased  isay)  from  10  to  15  percent,  while  the  percen¬ 
tage  not  expecting  war  decreased  from  60  to  25  percent, 
we  make  the  most  plausible  assumption  that  the  "Don't 
Know's"  represent  people  responding  to  the  same  forces 
which  seem  to  be  increasing  the  expectation  of  war.  Thus, 
they  are  combined  with  the  category  expecting  war. 

This  solution  increases  the  amount  of  change  in  the 
table  over  that  which  one  would  find  if  one  simply  elimin¬ 
ated  all  the  "Don't  Know"  responses.  Of  course,  some  of 
the  questionnaire  items  seemed  to  have  responses  which 
fall  naturally  into  a  dichotomy,  e.g.,  a  question  was 
asked  about  several  subjects  to  determine  whether  or  not 
the  respondent  had  a  "keen"  interest  in  the  subject. 


In  addition  to  those  items  for  which  an  index  of 
change  has  been  calculated  there  were  also  several  items 
or  indices  which  could  be  directly  correlated  with  cer¬ 
tain  themes  in  the  simulation.  For  example,  one  of  the 
indices  formed  was  an  index  of  change  in  information  over 
the  six-month  period  from  several  factual  items  relating 
to  international  affairs.  This  index  can  be  directly 
correlated  with  exposure  to  themes  in  the  scenario.  Also, 
several  of  the  March  questions  asked  about  the  respond¬ 
ent's  exposure  to  news  about  the  United  Nations  during 
the  previous  six  months;  for  these  questions  we  have 
coded  the  number  and  kinds  of  vehicles  mentioned  by  the 
respondents  as  exposure  vehicles  in  the  previous  six 
months.  These  items  are  then  directly  correlated  with 
exposure  to  the  scenario  themes.  In  Table  VXII-20  we 
have  listed  the  variables  measured  by  the  NORC  survey 
which  have  been  used  in  the  correlations  with  exposure 
and  the  net  changes  in  these  variables. 

How  is  Exposure  to  Themes  Related  to  Conversion  or  Ohange? 

whet  index  of  exposure  shall  we  attempt  to  correlate 
with  recall  of  exposure  ci:  with  the  index  of  change  in 
the  panel?  Given  that  we  are  running  the  correlations 
over  sixteen  population  subgroups,  an  obvious  candidate 
for  the  index  of  exposure  is  the  average  expected  number 
of  exposures  for  the  population  subgroup .  This  first 


Table  VIII-20.  NORC  Panel  Measures  Showing  Attitude  and  Information  Change  and 

Mass  Media  Exposure  over  the  Six  Months 
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news  about  relations 
with  Russia? 
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model  postulates  that  the  amount  of  change  shown  in  a 
subgroup  is  a  linear  function  of  the  average  number  of 
exposures  to  a  relevant  theme  by  members  of  that  sub¬ 
group  .  This  is  the  first  model  actually  tested  with  the 
data;  however/  this  model  ignores  the  likely  presence  in 
the  subgroup  of  people  who  initially  know  the  correct 
respon-  These  people,  according  to  present  communica¬ 
tions  r  ..ojLrch,  are  likely  to  be  more  exposed  than  those 
initially  not  knowing  and  therefore  the  average  number 
of  exposures  for  the  informed  and  the  uninformed  should 
differ.  A  second  model  posits  a  linear  relationship  be¬ 
tween  the  average  number  of  exposures  of  those  initially 
uninformed  in  the  subgroup  and  the  index  of  change  for 
the  subgroup. 

We  have  experimented  with  this  second  model  as  well 
as  with  the  first,  estimating  the  number  uninformed  in  a 
subgroup  as  equal  to  the  number  initially  giving  the 
wrong  response,  and  assuming  that  the  exposure  rate  of 
the  informed  is  1.5  or  2.5  times  that  of  the  uninformed. 

Finally  an  even  more  realistic  model  of  change  would 
posit  some  threshold  of  exposure  necessary  for  a  person  to 
learn  and  report  learning  to  an  interviewer.  With  this 
assumption,  we  explore  below  four  levels  of  complexity  of 
this  model. 
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Models  of  Conversion  or  Change  Within  Groups 

The  model  of  change  suggested  above  involves  two  con¬ 
ceptual  elements  which  together  combine  to  describe  the 
number  of  people  changing  as  a  function  of  the  average 
number  of  exposures.  The  first  of  these  elements  is  the 
range  of  thresholds  of  conversion  present  in  the  group  of 
people  initially  not  knowing.  This  range  comes  about 
either  because  at  the  onset  of  the  simulation  th  hypo¬ 
thetical  people  are  assumed  to  have  different  initial  ex¬ 
posure  levels,  and/or  because  o.  intrinsic  differences  in 
thresholds  due  to  the  present  attitude  structure,  cogni¬ 
tions,  etc.  In  general  we  will  find  a  distribution  of 
those  not  knowing  along  a  range  of  thresholds.  This  dis¬ 
tribution  might  be  a  constant  density  of  individuals,  or 
it  might  bell-shaped,  or  might  assume  any  one  of  a  number 
of  different  shapes.  The  second  element  is  the  net  ex¬ 
posure  probability  of  the  individuals  to  the  messages  in 
the  theme,  which  governs  their  individual  rates  of  growth 
of  expected  exposures.  Let  us  explore  how  these  elements 
combine  to  imply  several  different  kinds  of  relationships 
between  different  changes  or  conversion  indices  and  aver¬ 
age  number  of  exposures.  We  explore  below  several  cases 
using  hypothetical  values  and  relationships  between  the 
two  elements. 

Case  One.  Suppose  that  each  individual  has  the  same 
threshold  number  cf  exposures  for  a  change  to  occur,  and 
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in  addition,  that  all  persons  have  identical  net  exposure 
probabilities  for  each  message  in  the  theme.  If  this  is 
true,  then  the  expected  number  of  exposures  for  each  in¬ 
dividual,  which  is  just  the  sum  of  the  net  exposure  prob¬ 
abilities  for  the  individual,  all  grow  at  exactly  the 
same  rate  and  the  average  expected  number  of  exposures 
for  all  those  not  knowing  is  just  the  actual  value  of  the 
expected  number  of  exposures  for  each  individual  in  that 
group.  If  this  is  the  case,  then  all  the  expected  number 
of  exposures  grow  together  toward  the  threshold  and  all 
of  them  pass  the  threshold  at  the  same  time;  we  find  no 
change  at  all  prior  to  that  message  which  makes  each 
person's  expected  number  of  exposures  greater  than  or 
equal  to  the  threshold,  and  one  hundred  percent  change  at 
that  particular  message.  Thus  would  expect,  in  this 
case,  to  fine  that  the  change  ir.uex  has  only  two  possible 
values,  either  zero  or  one  (eliminating  the  possibility 
of  guessing,  of  course) ,  and  the  relationship  between 
average  exposure  and  change  is  a  step  function.  The 
correlation  between  the  change  index  and  the  average 
number  of  exposures  would  be  quite  small  if  the  average 
number  of  exposures  is  spread  out  over  a  significant 
range  for  the  cells  over  which  we  are  correlating. 

Case  Two.  In  this  case  let  us  assume  that  there  is 
a  range  of  thresholds,  that  the  individuals  not  knowing 
are  distributed  uniformly  over  this  range,  but  as  in  the 
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previous  case  the  net  exposure  probabilities  are  all 
equal  over  all  the  hypothetical  persons  not  knowing.  In 
this  case,  all  the  expected  number  of  exposures  grow  to¬ 
gether  toward  the  lowest  value  of  the  threshold  vange  but 
no  change  occurs  until  the  person  with  the  lowest  thres¬ 
hold  of  change  has  an  expected  number  of  exposures  equal 
to  that  threshold.  Thus,  the  change  index  is  zero  up 
to  this  point.  Once  we  have  reached  this  point,  the  number 
of  people  changing  will  be  proportional  to  the  increment 
in  expected  number  of  exposures  so  long  as  the  total  ex¬ 
pected  number  of  exposures  for  the  individual  does  not 
exceed  the  high  end  of  the  range.  If  it  exceeds  the  range, 
then  all  the  people  have  crossed  the  threshold  and  there 
is  no  change  even  though  the  number  of  exposures  contin¬ 
ues  to  grow.  (There  is  also  the  possibility  that  the 
range  is  so  small  that  the  next  message  after  one  enters 
the  range,  takes  one  out  of  the  range ,  or  in  fact  that 
one  jumps  completely  over  the  range,  in  which  case  we  have 
a  step  function  again.)  Thus,  in  this  case,  we  expect 
to  have  a  linear  increase  in  the  change  index  with  a  lin¬ 
ear  increase  in  expected  number  of  exposures,  from  the 
time  one  enters  the  threshold  until  the  time  one  passes 
out  of  the  threshold .  Thus  the  correlation  between  the 
average  number  of  exposures  and  the  change  index  should 
be  1.0  within  the  range  of  thresholds  but  for  exposures 
distributed  much  more  widely  than  the  range  of  thresholds 
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the  correlation  should  be  smaller,  and  the  more  widely 
the  exposures  are  distributed  the  smaller  the  correla¬ 
tion. 

Case  Three.  Another  possibility  is  that  the  thres¬ 
holds  are  all  the  same  for  all  individuals,  as  in  the 
first  case,  but  that  the  net  message  exposure  probabil¬ 
ities  differ  from  individual  to  individual,  from  zero  to 
one,  and  that  in  this  range  the  not-knowing  group  are 
uniformly  distributed.  (This  range  of  net  exposure  prob¬ 
ability  need  not  be  from  zero  to  one  in  this  example ,  but 

/ 

this  simplifies  matters.)  In  chis  case,  as  a  succession 
of  messages  is  processed  in  the  simulation,  the  individ¬ 
uals  with  the  highest  net  message  exposure  probabilities 
grow  quickly  to  the  threshold  and  pass  it,  and  the  people 
with  lower  net  message  exposure  probabilities  grow  more 
slowly.  The  net  result  is  that  as  messages  sure  processed 
there  is  a  constant  rate  of  conversion  of  individuals  to 
the  message,  and  a  constant  increase  in  the  change  index 
from  zero  to  oze  with  a  constant  increase  in  expected 
number  of  exposures,  until  the  last  person  is  converted. 

Case  Four.  In  the  most  coxnplicated  case,  the  unin¬ 
formed  members  of  the  group  are  not  uniformly  distributed 
over  the  threshold  range,  but  distributed  in  a  normal 
curve  or  some  other  distribution.  The  net  message  expo¬ 
sure  probabilities  for  the  members  are  not  equal,  but 
differ  from  individual  to  individual,  and  as  such  we  have 
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a  distribution  of  net  exposure  probabilities.  In  this 
case  as  messages  are  processed/  there  are  starts  and 
jerks  as  people  cross  the  threshold  level  of  exposure. 

The  resulting  growth  of  converted  people  is  certainly 
not  linear  with  growth  in  average  exposure,  and  in  this 
most  realistic  case,  the  maximum  correlation  would  cer¬ 
tainly  not  be  one. 

Since  we  have  no  data  on  exposure  thresholds  for 
the  simulated  population  and  since  the  simulation  does 
not  store  individual  exposure  rates,  but  only  subgroup 
averages,  we  have  not  attempted  to  apply  the  four  more 
elaborate  models  outlined  above.  The  most  realistic  case 
does  however,  suggest  that  the  relationship  between  aver¬ 
age  exposure  and  conversion  in  a  group  can  be  far  from 
linear;  thus  we  have  additional  grounds  for  expecting 
smaller  correlations.  In  the  next  pages  we  look  at  the 
correlations  resulting  from  the  models  directly  relating 
average  exposures  for  the  subgroup  or  an  estimate  of 
average  ex,  usures  for  the  uninformed  in  the  subgroup  to 
conversion  or  change . 


The  Effect  of  the  Exposure  Model  On  the  Correlations  Be¬ 
tween  Exposures  and  Changes  in  the  Panel 

In  order  to  examire  the  consequences  of  the  model 


transforming  average  raw  exposures  for  a  subgroup,  we 
have  collected  together  in  Table  VIII-21  several  correla¬ 
tions  of  changes  in  the  panel  over  the  six  months  or 
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'individual  transformed  average  exposures  are  oreated  for  eaoh  thane,  for  eaeh  of  the  panel-measured  variables  for  which  a  conver 
a ion  index  is  calculated.  "Average*  transformed  average  exposures  are  oreated  for  each  theme  for  the  entire  set  of  these  panel- 
measured  variables,  based  on  the  average  number  of  knowledgeable  and  leas  knowledgeable  persons  in  each  population  subgroup  over 
the  entire  set  of  these  variables. 
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recollections  of  exposure  by  the  panel  for  the  six  months 
with  the  tenth  theme  of  the  scenario#  explicit  explanation 
of  the  United  Nations.  We  chose  this  theme  because  the 
message  content  seemed  particularly  clear  and  definite 
and  we  might  expect  high  correlations  with  changes  in 
attitudes  or  information  about  the  U.N.,  and  also  because 
the  moderate  number  of  messages  in  this  theme  and  the  re¬ 
sulting  moderate  overall  average  exposure  produced  negli¬ 
gible  errors  in  recording  average  exposures.  The  error 
for  this  theme  was  only  5.1  exposures  in  a  total  of  3400 
•-  xposures . 

First  we  look  at  the  correlations  of  raw  average  ex¬ 
posure  with  several  of  the  variables  measured  by  the  NORC 
survey.*3  These  variables  include  the  level  of  informa¬ 
tion  change  over  the  six  month  period  I measured  by  the 
ability  to  answer  five  factual  questions  about  the  United 
Nations) ,  high  information  change  being  a  net  improvement 
of  from  three  to  five  factual  answers  and  medium  informa¬ 
tion  change  being  net  improvement  of  from  one  or  two  an¬ 
swers.  Two  more  variables  measure  recall  of  exposure  to 
information  about  the  United  Nations  via  the  newspapers 
or  radio.  The  next  two  variables  are  increased  knowledge 

*3A1 though  we  have  average  exposures  by  each  time 
period  for  a  total  of  thirteen  points  at  which  the  average 
exposure  is  measured#  we  have  calculated  the  correlations 
only  for  every  other  time  period  beginning  with  the  first, 
and  ending  with  the  thirteenth  in  order  to  simplify  the 
presentation  of  the  data. 


425 


of  the  main  function  of  the  United  Nations/  i.e.,  peace¬ 
keeping  and  acquaintanceship  wi _h  the  veto  power.  The 
next  variables measure  decreasing  satisfaction  with  the 
progress  of  the  United  Nations,  the  increasing  proportion 
over  the  six  months  expressing  keen  interest  in  the 
United  Nations,  and  the  proportion  at  the  end  of  the  six 
months  who  say  they  have  more  interest  in  the  United  Na¬ 
tions  than  they  did  six  months  prior.  The  final  variable 
is  the  increased  expectation  of  war  within  the  next  ten 
years . 

We  noted  in  our  discussion  above  of  the  model  relat¬ 
ing  average  exposure  and  conversion,  that  if  we  do  not 
reach  saturation  as  people  are  exposed  to  the  messages 
of  a  theme,  we  might  expect  that  the  correlation  between 
average  exposures  and  the  changes  measured  after  six 
months  would  grow  stronger  as  the  time  periods  in  the 
simulation  progress.  In  other  words,  unless  there  is 
some  sort  of  a  saturation  effect,  we  should  expect  that 
the  relationship  between  the  average  exposures  and  changes 
found  in  the  panel  would  be  strongest  after  all  the  mes¬ 
sages  of  the  theme  are  run  rather  than  at  some  interme¬ 
diate  point  when  only  some  part  of  the  messages  of  the 
theme  had  been  run.  However,  the  data  of  Table  VIII-21 
indicate  that  there  is  some  saturation  or  more  compli¬ 
cated  effect  than  the  simple  relationship  between  average 
exposure  and  change .  None  of  the  ten  dependent  variables 


sh  '.-vs  a  consistent  pattern  of  increasing  correlations 
as  the  time  periods  progress;  in  general  the  correlations 
fluctuate  somewhat  from  time  period  to  time  period  about 
an  average  value.  In  fact,  it  does  not  seem  that  one 
time  period  offers  a  more  respresentative  correlation 
than  any  other  time  period  and  that  we  could  have  rim 
the  scenario  through  only  one  time  period  and  gotten  cor¬ 
relations  with  any  one  of  the  ten  dependent  variables  of 
about  the  same  magnitude  and  direction  as  we  observe  at 
the  end  of  thirteen  time  periods.  If  the  content  analy¬ 
sis  and  the  model  of  the  mass  media  distribution  system 
are:  valid,  then  we  must  conclude  that  there  is  a  more 
complicated  relation  (e.g.,  a  threshhold  effect)  between 
average  exposure  to  a  theme  and  opinion  and  information 
changes  relevant  to  that  theme  over  a  six  month  period. 

The  changes  measured  by  the  NORC  survey  cannot  be  linear 

functions  of  the  average  number  of  exposures  for  these 
14 

subgroups . 

Another  fact  which  probably  diminishes  the  strength 
of  the  correlations  is  that  none  of  the  dependent  vari¬ 
ables  are  directly  related  to  explicit  explanation  of 
the  United  Nations,  the  closest  being  the  increased 

14 

Another  reason  for  the  fact  that  the  magnitude  of 
the  correlation  does  not  grow  as  the  time  period  increases, 
may  be  that  there  is  a  complicated  process  of  assimilat¬ 
ing  and/or  integrating  the  content  of  the  messages  over 
the  six  month  period. 
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knowledge  of  the  main  function  (peacekeeping)  of  the 
United  Nations.  The  most  direct  measure  with  which  we 
could  correlate  exposure  to  this  theme  would  be  a  ques¬ 
tion  asking  people  if  they  recall  listening  to  a  discus¬ 
sion  or  reading  an  article  which  helped  to  explain  the 
purpose  and  functioning  of  the  United  Nations.  Unfor¬ 
tunately  we  have  no  such  question  in  the  NORC  data,  and 
this  recall  would  probed) ly  be  quite  unreliable  even  if 
we  did.  In  addition,  it  may  very  well  be  that  those 
people  who  would  read  or  listen  to  messages  offering  an 
explicit  explanation  of  the  United  Nations  are  those  who 
already  are  quite  familiar  with  it.  If  this  is  true  we 
would  not  expect  a  high  correlation  between  changes  in 
knowledge  and  exposure  to  these  messages. 

On  page  417  above  we  proposed  a  transformation  of 
the  raw  average  exposure  to  the  theme  in  order  to  account 
for  the  fact  that  those  people  who  initially  had  the 
prevailing  knowledge  or  opinion  would  probably  have  a 
higher  likelihood  of  being  exposed  to  the  theme  while 
those  with  less  knowledge  would  have  a  lower  likelihood 
of  exposure.  Since  those  people  with  less  initial  know¬ 
ledge  account  for  the  changes  measured  in  the  panel, 
we  should  correct  the  raw  average  exposure  of  the  sub¬ 
group  or  population  type  to  produce  the  average  exposure 
only  if  those  people  who  might  be  susceptible  to  change 
over  the  six  month  periods,  i.e.,  those  initially  not 
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knowing  the  information  or  not  having  the  opinion  towards 
which  other  people  were  changing.  These  data  transform¬ 
ations  were  performed  as  described  above  for  each  of  the 
dependent  variables  for  which  there  is  a  conversion  index. 
The  resulting  averages,  called  individual  transformed 
average  exposures,  have  been  calculated  assuming  that 
those  already  knowing  have  either  one  and  a  half  times 
the  average  exposure  of  those  not  knowing  (K  =  1.5)  or 
that  they  have  two  and  a  half  times  the  average  exposure 
of  those  not  knowing  (K  =  2.5).  The  next  two  groups  of 
correlations  in  Table  VIII-21  show  the  resulting  correl¬ 
ations  with  these  individual  transformed  average  expo¬ 
sures  for  each  of  the  two  values  of  K.  We  show  these 
correlations  for  only  five  of  the  dependent  variables 
because  the  other  variables  are  measured  only  once  (at 
the  end  of  tne  six  month  period) ;  for  these  variables 
there  is  no  way  to  estimate  the  initial  number  of  people 
not  knowledgeable  nor  having  the  "correct"  opinion. 

These  five  correlations  sure  not  generally  stronger  than 
the  correlations  with  the  raw  average  exposures.  There 
appears  to  be  no  large  change  in  the  magnitude  or  sign 
of  the  correlation  either  with  K  equal  to  1.5  or  with  K 
equal  to  2.5. 

If  the  model  with  the  transformations  were  signifi¬ 
cantly  better  them  the  raw  average  exposure  model,  then 
we  would  expect  to  find  marked  increases  in  the  magnitude 
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of  the  correlations  for  the  dependent  variables  which  are 
most  closely  related  to  the  theme.  The  two  variables 
measuring  increased  knowledge  of  the  function  of  the 
United  Nations  and  acquaintanceship  with  the  veto  power 
are  those  variables  which  seem  most  strongly  related  to 
the  theme;  for  the  first  of  these  variables  the  correl¬ 
ations  does  become  somewhat  stronger  but  never  much 
stronger  than  the  correlation  with  the  raw  average  ex¬ 
posures  above.  For  the  second  variable  the  correl¬ 

ation  becomes  weaker.  This  pattern  of  little  change  in 
the  correlations  as  a  result  of  the  transformations  is 
also  true  for  the  other  themes  and  time  periods.  Thus, 
the  model  does  not  seem  to  add  much  to  the  explanation 
of  the  changes  in  the  Cincinnati  panel  over  the  six 
months. 

In  a  final  attempt  to  improve  upon  the  transforma¬ 
tion  model,  we  have  calculated,  over  the  twenty  panel 
variables  for  which  there  cure  measurements  both  at  the 
beginning  and  at  the  end  of  the  six  months,  an  average 
number  of  less  knowledgeable  people  for  each  of  the  pop¬ 
ulation  subgroups.  The  resulting  transformed  average 
exposures (which  we  have  called  average  transformed  aver¬ 
age  exposures)  have  also  been  correlated  with  the  panel 
variables  for  K  equal  to  1.5  and  2.5.  These  correlations 
are  shown  for  seven  time  periods  for  each  of  the  depend¬ 
ent  variables  in  the  table.  Again,  we  find  no  consistent 
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increase  in  the  strength  of  the  relationship  as  the  time 
periods  progress  and  no  consistent  increase  in  the 
strength  of  the  relationship  when  we  compare  the  average 
transformed  average  exposure  model  and  the  raw  average 
exposure  model. 

We  conclude  that  the  correlation  of  the  raw  average 
exposure  with  the  panel  variables  is  about  as  good  as 
with  the  transformed  exposures.  Also  the  correlations 
at  any  time  period  are  approximately  equal  to  the  correl¬ 
ations  at  any  other  time  period.  Therefore,  in  Table 
VIII-22  below  for  each  theme  we  show  the  correlations 
of  raw  average  exposures  with  each  of  the  panel  variables 
for  just  one  of  the  time  periods.  Usually  this  is  the 
thirteenth  time  period,  but  when  these  data  are  not 
available  (some  data  were  lost  due  to  computer  and/or 
programming  errors)  or  when  they  cure  inaccurate  (due  to 
the  sizeable  cummulation  errors  in  themes  1,  2,  3,  and 
4) ,  we  have  felt  justified  by  these  findings  in  present¬ 
ing  correlations  for  other  time  periods.  We  turn  now 
to  the  presentation  and  analysis  of  these  data  for  each 
of  the  twelve  themes  and  thirty-one  panel  variables. 

The  Correlation  Between  Exposures  to  the  Themes  and  the 
Variables  From  the  NORC  Panel 

Table  VIII-22  shows  the  correlations  over  sixteen 
population  subgroups,  between  the  raw  average  expected 
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number  of  exposures  for  each  of  the  twelve  scenario 
themes  and  certain  information  and  opinion  changes  and 
recall  of  exposure  variables  measured  in  the  Cincinnati 
panel  over  the  six  months  by  the  NORC  survey.  The  most 
startling  and  impressive  feature  of  the  table  is  that 
the  largest  variation  in  the  correlations  is  across  the 
panel  variables  rather  than  across  themes.  We  might  ex¬ 
pect  that  the  panel's  increasing  awarness  of  problems  in 
dealing  with  the  U.S.S.R.  would  correlate  most  highly 
with  the  second  theme  (US-USSR  hostility) ,  or  at  least 
that  the  correlation  would  vary  with  the  theme,  but  in 
fact  the  correlation  is  nearly  constant  over  the  twelve 
themes. 

What  can  account  for  this  pattern  of  correlatic  ? 
We  know  that  the  pattern  of  the  average  number  of  expec¬ 
ted  exposures  across  the  sixteen  population  subgroups  is 
quite  similar  for  each  of  the  themes  in  the  scenario. ^ 
The  main  differences  between  the  exposures  to  the  themes 
is  in  the  overall  average  level  of  exposure,  which  de¬ 
pends  upon  the  number  of  messages  associated  with  the 
theme  rather  than  interest  in  a  particular  theme. 


In  Table  VIII-12  above  we  noted  that  males  of  high 
school  or  grade  school  education  and  high  socio-economic 
status  and  females  of  grade  school  education  and  low 
socio-economic  status  were  the  most  and  least  exposed 
subgroups  respectively  for  each  of  the  twelve  themes. 
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However,  those  population  subgroups  which  are  likely  to 
be  exposed  relatively  highly  to  one  theme  are  likely  to 
be  exposed  relatively  highly  to  every  other  theme  in  the 
scenario.  Thus,  either  the  messages  in  the  media  ve¬ 
hicles  are  distributed  in  much  the  same  way  from  theme 
to  theme  or  enough  duplication  exists  between  the  vehicle 
audiences  so  that  the  messages  reach  the  same  people 
regardless  of  their  differences  in  distribution  in  the 
vehicles,  or  a  combination  of  these  occurs. 

In  general,  what  are  the  possible  sources  of  varia¬ 
tion  xn  relative  exposure  of  population  subgroups  across 
themes?  First,  if  the  vehicles  or  sets  of  vehicles  in 
the  media  system  reach  different  relative  proportions  of 
the  population  subgroups,  then  a  concentration  of  mes¬ 
sages  in  one  vehicle  or  set  of  vehicles  will  produce  a 
different  distribution  of  vehicle  exposures  from  that 
of  a  concentration  of  messages  in  a  different  vehicle  or 
set  of  vehicles .  One  theme  might .  appear  mostly  in  morn¬ 
ing  weekend  radio  broadcasts,  while  another  theme  might 
appear  primarily  in  weekday  evening  newspapers.  Table 
VIII-23  shows  the  percentage  distributions  of  the  three 
weekday  newspapers  by  sex,  age,  and  education.  We  find 
that  even  though  the  total  audiences  differ  in  size, 
their  distribution  across  the  population  types  are  not 
greatly  different;  the  fact  that  a  theme  occurs  predom¬ 
inantly  in  one  or  the  other  of  the  newspapers  would  not 
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change  its  relative  distribution  across  population  sub¬ 
groups.  This  is  true  also  for  the  Sunday  newspaper, 
since  its  distribution  was  generated  from  that  of  the 
daily  Enquirer.  Recall  that  in  our  description  of  the 
radio  vehicle  audiences,  we  used  only  sex  as  a  defining 
dimension.  Thus,  the  proportional  distribution  of  these 
audiences  over  the  other  dimensions  as  produced  by  the 
Mosteller  parameter  estimation  routine  will  be  identi-. 
cui.  Therefore,  the  greatest  differences  which  might 
appear  between  the  vehicle  audiences  of  the  two  themes 
would  occur  if  one  theme  had  most  of  its  messages  in  one 
of  the  daily  newspapers,  while  another  theme  had  most  of 
its  messages  in  the  radio  vehicles.  In  our  scenario  it 
would  be  very  unlikely  for  one  of  the  themes  to  have 
most  of  its  messages  distributed  throughout  the  radio 
vehicles  since  all  of  the  messages  were  originally  de¬ 
rived  from  a  content  analysis  of  the  press.  It  seems 
likely,  therefore,  that  the  messages  of  each  of  the 
themes  were  distributed  in  both  the  press  and  the  radio 
and  that  the  resulting  distributions  of  the  audiences 
of  the  vehicles  carrying  the  messages  was  nearly  equiv¬ 
alent  from  theme  to  theme  except  in  the  total  size. 

A  second  possible  cause  of  different  distributions 
of  exposures  from  theme  to  theme  is  possible  varying 
levels  of  attention  to  different  themes  by  the  popula¬ 
tion  subgroups.  These  levels  of  attention  are  calculated 
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in  the  conditional  message  exposure  probabilities  of 
the  second  stage  of  the  simulation.  There  are  two  ways 
in  which  these  message  exposure  probabilities  can  create 
quite  different  message  audience  distributions  across 
the  cells  even  though  the  vehicle  audiences  are  quite 
similar. 

First,  the  estimation  of  the  exposure  probabilities 
for  men  and  women  from  the  regression  equations  might 
produce  for  the  messages  of  one  theme  a  quite  different 
distribution  of  men  and  women  than  for  a.  second  theme. 
This  could  happen  if  the  messages  of  the  one  theme  were 
concentrated  in  the  sport  pages.  Insofar  as  this  is 
not  true,  however,  the  ratio  of  the  average  probability 
of  exposure  for  men  to  that  for  women  would  probably  not 
change  significantly  from  theme  to  theme.  Note  also  that 
this  regression  distinguishes  only  between  men  and 
women  and  predicts  no  probabilities  based  on  other  of 
the  population-defining  dimensions. 

Second,  the  ratios  of  the  conditional  exposure  prob¬ 
abilities  which  distribute  different  probabilities  over 
some  of  the  population  dimesnions  could  be  different 
from  theme  to  theme  and  would  therefore  produce  signifi¬ 
cant  differences  in  the  message  audiences  from  theme  to 
theme.  We  might. find,  for  example,  that  for  a  given 
theme  the  probability  of  exposure  of  well-educated  peo¬ 
ple  was  much  lower  in  comparison  with  poorly-educated 
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people  than  for  a  second  theme.  Although  the  simulation 
can  accept  different  ratios  for  each  ther.,’:,  ve  have  not 
been  able  to  distinguish,  within  the  general  area  of  ex¬ 
posure  to  international  news,  thematic  differences  in 
the  ratios  of  conditional  message  exposure  probabilities, 
e.g.,  for  college-  and  high  school-educated  people  or 
high  scttool-  and  grad's  school-educated  people,  etc., 
because  these  data  dimply  do  not  exist  except  for  the 
general  category  c-f  international  news.  Thus,  this  pos¬ 
sible  source  of  variation  in  exposure  from  theme  to 
theme  was  not  present  in  our  simulation  runs. 

This  is  not  to  say  that  the  distributions  of  expo¬ 
sures  across  the  population  types  in  the  present  simu¬ 
lation  are  incorrect,  it  may  very  well  be  true  for  this 
limited  variety  of  news,  carried  orly  by  newspapers  and 
radio  vehicles,  tnat  the  major  differences  in  exposure 
from  theme  to  -heme  are  not  differences  in  distribution 
across  population  types  but  rather  differences  in  levels 
of  exposure  due  to  the  different  numbers  of  messages  and 
their  relative  importance  as  indicated  by  their  format 
factors.  It  seems  clear,  however,  that  if  we  were  going 
to  specify  more  precisely  the  distribution  of  exposure 
to  the  various  themes  across  the  population  types,  we 
might  well  invest  research  dollars  investigating  addi¬ 
tional  dimensions  of  the  vehicle  audiences  of  the  radio 
news  broadcasts,  the  different  formats  of  messages  and 
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their  effect  on  message  exposure  across  population 
types,  and,  in  addition,  a  content  analysis  of  the  radio 
news  broadcasts  to  determine  directly  the  number  and 
kind  of  messages  carried  via  these  media  vehicles. 

The  finding  that  those  people  who  are  most  exposed 
to  one  theme  are  likely  to  be  most  exposed  to  other 
somewhat  similar  themes  in  the  mass  media  seems  quite 
consonant  with  the  common  finding  of  audience  studies, 
that  those  people  who  participate  highly  in  one  kind  of 
communications  behavior  participate  highly  in  many 
other  kinds  of  communication  behavior. 

The  Pattern  of  Correlations 

Although  the  general  pattern  of  subgroup  exposure 
across  themes  was  constant,  perhaps  we  may  find  some 
tendency  for  the  correlations  to  be  higher  between  panel 
measures  which  seem  to  be  more  closely  related  to  cer¬ 
tain  of  the  themes,  implying  some  slight  differentiation 
in  the  audiences  reached  by  the  themes.  Thus,  we  might 
expect  that  the  correlations  of  any  of  the  first  four 
themes  with  such  things  as  problems  or  expectations  of 
war  or  problems  with  the  U.f/.S.R.  might  be  slightly 
higher  than  the  correlations  of  the  other  less  obviously 
related  themes  with  these  panel  variables.  In  general, 
this  seems  not  to  be  the  case,  although  it  is  true  that 
the  correlations  of  the  first  four  themes  with  the 
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increasing  expectations  of  war  are  somewhat  higher  in 
general  than  that  of  the  other  themes.  However,  this 
does  not  hold  true  for  problems  with  the  U.S.S.R.,  with 
problems  of  war,  or  with  the  correlations  between  the 
knowledge  of  the  veto  power  in  the  United  Nations  and 
the  themes  relating  to  the  veto.  Thus,  we  have  no  evi¬ 
dence  in  this  table  that  exposure  to  a  particular  theme 
increases  conversion  or  likelihood  of  change  of  opinion 
on  a  particular  NORC  panel  measure. 

From  the  discussion  above  we  conclude  that  exposure 
to  any  of  the  themes  is  equivalent  to  exposure  to  inter¬ 
national  news  messages  in  the  mass  media  system  in 
general,  and  that  this  general  exposure  has  different 
correlations  with  the  different  measures  in  the  NORC 
panel.  Which  panel  variables  seem  to  correlate  most 
highly  with  this  general  exposure?  The  data  in  Table 
VIII-22  indicate  that  exposure  to  the  themes  correlates 
strongly  and  negatively  with  a  high  information  change 
from  September  to  March,  very  strongly  and  positively 
with  a  medium  information  change  from  September  to 
March,  strongly  and  positively  with  recall  of  exposure 
to  information  about  the  United  Nations  in  meetings, 
and  also  moderately  highly  with  recall  of  exposure  to 
the  United  Nations  through  newspapers  over  the  six 
months.  In  addition,  exposure  seems  to  correlate  highly 
with  increased  knowledge  of  the  veto  and  it  correlates 
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moderately  highly  with  the  increasing  dissatisfaction 
with  the  progress  of  the  United  Nations. 

Several  of  the  most  important  changes  that  occurred 
in  the  attitudes  of  the  Cincinnati  population  over  the 
six  months  were  the  large  increases  in  the  proportion 
who  named  as  problems  facing  the  country,  another  war 
or  maintaining  the  peace  (24  percent  to  46  percent)  , 
our  relations  with  Russia  (16  percent  to  29  percent;) , 
and  also  the  very  large  increase  in  the  proportion  who 
expected  the  United  States  to  enter  another  weir  within 
the  next  ten  years  (48  percent  to  73  percent) .  The 
correlation  between  the  first  of  these  changes  and  ex¬ 
posure  to  the  mass  media  is  very  weak  negative  correla¬ 
tion  for  every  theme,  indicating  no  overall  relation¬ 
ship  between  increasing  concern  about  war  and  any  of 
the  scenario  themes.  Thus,  if  exposure  to  these  themes 
in  the  mass  media  was  causing  some  people  to  be  more 
aware  of  the  likelihood  of  war,  then  there  were  other 
influences,  e.g.,  word  of  mouth  messages,  which  were 
causing  those  people  in  population  types  not  so  highly 
exposed  to  share  the  increase  in  concern  about  the 
problem  of  war. 

For  the  other  two  changes,  the*~>  are  small  positive 
correlations  with  exposure,  indicating  that  there  was 
some  tendency  for  those  population  subgroups  who  had  a 
higher  average  exposure  to  have  a  higher  conversion 
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index  or  proportional  change  in  the  direction  of  expect¬ 
ing  war  cr  naming  the  U.S.S.R.  as  a  problem  to  be  con¬ 
tended  with.  Unfortunately ,  the  thirteenth  theme  of 
the  simulation  containing  messages  relating  to  control 
of  the  atomic  bomb,  was  lost  due  to  a  programming  error 
and  therefore  we  do  not  have  a  correlation  between  this 
theme  and  the  panel's  increasing  interest  in  the  atomic 
bomb.  However,  we  do  find  a  moderatly  high  correlation 
between  this  variable  and  exposure  to  the  other  scenario 
themes. 

One  of  the  most  direct  relationships  likely  in  this 
data  should  be  that  of  simulated  exposure  and  recall  of 
exposure  to  the  United  Nations  via  certain  media  vehicles. 
The  pattern  of  high  positive  correlations  for  recall  of 
exposure  through  meetings,  newspapers,  or  through  a  com¬ 
bination  of  from  three  to  eight  media,  and  the  negative 
correlations  for  recall  of  exposure  via  the  radio  or  by 
leaflets  may  be  explained  by  the  plausible  assumption 
that  better  educated,  higher  status  persons  are  more 
likely  to  be  exposed  and  recall  exposure  via  newspapers, 
magazines,  books,  and  meetings,  and  are  relatively  less 
likely  to  be  exposed  or  to  recall  exposures  via  radio 
news  broadcasts  or  through  leaflets.  In  addition,  (pre¬ 
sumably)  these  people  became  increasingly  satisfied 
during  the  six  month  period  with  the  progress  of  the 
United  Nations  (the  correlations  with  increasing 
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dissatisfaction  are  negative).  However,  paradoxically, 
they  are  increasingly  likely  to  respond  that  the  United 
Nations  probably  will  not  succeed. 

How  can  we  explain  the  strong  positive  correlation 
with  medium  information  change  and  strong  negative  cor¬ 
relation  with  high  information  change?  Recall  that  the 
index  of  information  change  from  September  to  March  mea¬ 
sures  improvement  in  answering  correctly  the  six  ques¬ 
tions  asking  about  knowledge  of  the  various  aspects  of 
the  United  Nations.  High  information  change  indicates 
a  net  increase  in  the  index  of  from  three  to  five 
points;  medium  information  change  indicates  an  increase 
from  one  to  two  points.  If  we  assume  that  the  news¬ 
paper  readers  and  meeting  attenders  are  also  people  who 
are  initially  relatively  well-informed,  then  they  can 
become  only  slightly  better  informed  over  the  six 

j 

months,  resulting  in  a  medium  information  change.  Those 
people  who  had  a  high  information  change  are  people  who 
knew  very  little  at  the  beginning  of  the  six  month 
period  (who  somehow  either  learned  very  much  during  the 
period  or  are  guessing  and  made  lucky  guesses)  and 
these  are  people  who  tend  to  be  little  exposed  to  inter¬ 
national  news  messages  in  the  media  system. 

One  of  the  interesting  effects  in  these  correla¬ 
tions  occurs  for  the  question  asked  of  the  panel  con¬ 
cerning  their  interest  in  the  United  Nations  at  the  end 
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of  the  six  months  compared  with  their  previous  interest. 
Although  we  find  an  insignificant  negative  correla¬ 
tion  between  exposure  and  ‘he  proportion  claiming  more 
interest  in  the  United  Nations,  the  correlations  between 
exposure  and  the  proportion  explaining  their  increased 
interest  either  in  terms  of  publicity  given  to  the  U.N. 
or  in  terms  of  world  crises  are  both  small  and  positive. 
Evidently,  those  people  who  can  rationalize  their  in¬ 
creased  interest  are  more  likely  to  be  exposed. 

Finally,  we  have  a  small  positive  correlation  be¬ 
tween  those  people  who  are  able  to  identify  the  slogan 
of  the  U.N.  information  campaign  at  the  end  of  six 
months  and  those  exposed  to  the  mass  media. 

We  may  draw  several  conclusions  from  our  data 
about  exposure  to  the  various  scenario  themes  and  its 
relationship  to  the  NORC  panel  data.  For  themes  re¬ 
lating  to  general  international  affairs  news,  the  data 
by  which  we  specify  the  simulation  model  do  not  allow 
us  to  differentiate  very  well  the  patterns  of  distri¬ 
bution  of  exposures  across  population  types  from  theme 
to  theme,  except  in  the  overall  average  level  of  expo¬ 
sure  to  each  theme,  which  is  primarily  a  function  of 
the  number  of  messages  relating  to  the  theme.  In  the 
simulation  it  appears  that  the  variations  in  the  expo¬ 
sures  of  the  audience  across  the  population  types  is 
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governed  more  by  the  media  habits  of  the  audience  than 
by  theme-specific  differences  in  attention. 

The  correlations  in  the  table  may  be  artificially 
weak  due  to  random  errors  in  the  data.  In  addition , 
they  may  be  weak  because  we  have  not  attempted  to  simu¬ 
late  several  other  souuces  of  exposure/  especially  the 
signs,  leaflets,  meetings,  magazines,  and  films  which 
were  used  to  some  degree  in  the  information  campaign.- 
nor  have  we  attempted  to  simulate  word-of-mouth  or  in¬ 
formal  communications  relating  to  these  themes.  The 
latter  could  very  well  be  a  third  stage  of  the  simula¬ 
tion  wherein  people  receiving  certain  kinds  of  messages 
from  the  mass  media,  communicate  with  other  people  in 
the  simulation  population  about  these  messages.  Pre¬ 
sumably,  interpersonal  communication  often  has  a 
stronger  effect  upon  those  involved  in  the  communication 
than  messages  via  the  mass  media.  An  interesting  ques¬ 
tion  would  be  what  proportion  of  the  variance  in  con¬ 
version  or  change  in  the  level  of  knowledge  over  six 
months  can  be  explained  simply  in  terms  of  the  messages 
in  the  mass  media  as  we  have  simulated  them  and  what 
proportion  is  accounted  for  by  word-of-mouth  communi¬ 
cations  and  less  formal  forms  of  communications .  We 
note  that  the  strongest  correlation  found  in  the  present 
simulation,  for  recall  of  exposure  to  the  U.N.  in 
meetings  and  for  medium  information  change,  has  a  value 


of  .74.  In  this  case,  the  exposures  to  these  themes  in 
the  mass  media  as  we  have  modeled  them  explain  at  most 
about  fifty  percent  of  the  variance  in  these  particular 
dependent  variables.  With  a  simulation  model  which  in¬ 
cluded  various  sources  of  communication,  and  recorded 
exposures  by  source  of  communication ,  we  might  be  able 
to  increase  the  explanatory  power  and  separate  the  ef¬ 
fects  of  the  various  sources  of  communication.  This  is 
a  possibility  for  future  research. 

Finally,  we  would  like  to  point  out  that  the  cor¬ 
relations  are  across  groups  and  are  therefore  ecologi¬ 
cal  correlations.  We  have  not  correlated  individual 
change  with  individual  exposure  because  the  simulation 
does  not  presently  report  individual  exposures  (al¬ 
though  it  does  keep  internally  a  record  of  individual 
exposures)  and  also  because  the  survey  data  is  probably 
not  good  enough  to  provide  reliable  data  on  individual 
changes.  However,  there  is  one  additional  reason  for 
not  running  correlations  by  individuals.  Within  a 
population  type,  all  individuals  are  equivalent  except 
in  their  probabilities  of  exposure,  i.e.,  we  have  no 
way  initially  of  associating  one  individual  with  an  es¬ 
pecially  high  likelihood  of  conversion  and  another  with 
a  low  likelihood  of  conversion  until  after  the  proba¬ 
bilities  of  being  in  the  audiences  of  the  messages  have 
been  computed.  Thus,  there  is  no  way  of  making  a 
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one-to-one  correspondence,  prior  to  running  the  simu¬ 
lation,  between  an  individual  in  the  simulation  popu¬ 
lation  and  a  member  of  the  NORC  panel. 

What  can  we  finally  conclude  about  tha  impli¬ 
cations  of  the  simulated  exposures  and  related  changes 
in  the  NORC  panel  for  the  validity  of  the  simulation? 
First,  although  we  cinnot  easily  calculate  an  appropri¬ 
ate  correlation  between  the  overall  changes  in  the  panel 
and  overall  exposure  to  the  simulation  themes  (because 
there  is  no  obvious  one-to-one  matching  of  panel  vari¬ 
ables  and  simulated  themes) ,  we  do  find  a  high  degree 
of  correspondence  between  those  areas  of  greatest 
change  in  the  panel  and  the  themes  showing  the  highest 
average  exposure.  The  largest  changes  in  opinions  and 
attitudes  involved  the  issues  of  war  and  peace  and  re¬ 
lations  with  the  U.S.S.R.:  the  themes  concerned 
with  these  issues  (the  first  four  themes  in  the 
scenario)  were  by  far  the  most  important  themes,  both 
in  terms  of  the  numbers  of  actual  messages  and  also 
in  terms  of  the  average  number  of  exposures  in  the  pop¬ 
ulation.  The  information  and  opinion  areas  concerning 
the  functions  of  the  U.N.  and  levels  of  support  for  it, 
which  showed  very  slight  changes  in  the  panel,  also  had 
only  few  messages  in  the  scenario  and  very  low  levels 
of  exposure  in  the  simulated  population  over  the  six 
months.  Thus,  the  simulation  results  for  those  broad 


areas  of  opinion,  attitude,  and  information  seem  to  support 
the  validity  of  the  model  in  predicting  average  levels  of 
exposure  for  the  population. 

The  second,  more  difficult,  attempt  at  validation 
by  using  correlations  across  population  subgroups,  seems  not 
to  be  so  successful.  These  data  do  not  add  significantly  to 
our  attempts  at  validation  of  the  exposures  produced  by  the 
mass  media  simulation.  Some  of  the  correlations  we  found 
here  are  understandable  and  others  require  some  strain  in 
interpretation.  Our  analysis  suggests  some  of  the  kind  of 
things  we  can  do  with  this  simulation,  but  it  also  illus¬ 
trates  the  magnitude  of  the  task  of  attempting  a  simulation 
of  this  sort  and  the  additional  kinds  of  information  and 
data  which  must  be  developed  in  order  to  make  a  more  specific 
differentiation  of  the  audiences  of  such  closely  related 
themes. 

The  Mass  Media  Simulation:  Summary  and  Conclusions 

This  study  is  really  a  series  of  studies  combined 
into  one.  Simulation  is  apt  to  force  one  into  this  kind 
of  operation  because  it  requires  explicit  data  in  many 

different  areas  relevant  to  the  problem  at  hand. 

The  first  stage  of  the  study  might  be  called  the 

elaborate  and  complete  specification  of  the  model. 

This  means  that  the  model  is  not  just  a  verbal  model 
of  the  process  of  exposure  in  a  mass  media  system,  con¬ 
tained  in  some  number  of  pages  of  prose,  but  a  mathe¬ 
matical  model  developing  the  process  from  one  stage  to 
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another  in  such  detail  that  it  can  be  programmed  for  a 
computer.  In  the  development  of  this  model,  of  course, 
there  are  possibilities  for  calculation  and  elaboration 
due  to  the  fact  that  the  model  will  be  implemented  on 
a  computer.  However,  the  computer  also  imposes  con¬ 
straints  in  size  and  time  of  calculation  upon  the  model. 

The  next  stage  in  the  development  is  the  actual 
programming  of  the  mathematical  model  in  order  to  im¬ 
plement  it  on  the  computer.  In  reality,  it  is  not 
true  that  the  first  and  second  stages  are  independent 
of  one  another;  it  always  turns  out  that  the  most  com¬ 
plete  initial  mathematical  specification  of  a  very  gen¬ 
eral  model  such  as  this  omits  quite  in£>ortant  transi¬ 
tions  between  various  stages  or  fails  to  recognize  as¬ 
sumptions  which  must  be  made  and  which  only  come  to 
light  in  programming  the  model  for  the  quite  literal¬ 
minded  computer.  The  present  simulation  was  progra.  "a 
and  reprogrammed  over  a  period  of  several  years  by 
many  different  students  and  finally  made  operational 
by  Selesnick  and  Kramer  after  a  year  of  extensive  and 
intensive  work.  The  result  comprises  seven  different 
machine  loads  or  links  of  programs.  After  programming 
the  model,  of  course,  there  must  be  debugging  and  test 
runs  and  it  will  happen  that  contingencies  that  were 
not  thought  of  in  the  previous  stages  will  arise  and 
must  be  dealt  with.  Even  afti  his  point,  however. 


each  and  the  measurement  of  duplication  between  vehicles 
might  well  be  in  itself  a  long  and  elaborate  study. 

Obviously,  the  content  analysis  necessary  to  de¬ 
scribe  the  scenario  of  messages  which  appear  to  the 
audience  could  provide  at  least  one  thesis  if  carried 
out  in  a  most  rigorous  fashion.  This  content  analysis 
must  not  only  analyze  certain  vehicles  for  occurrences 
of  certain  themes,  but  must  also  specify  such  matters 
as  the  format  of  each  of  the  messages  carrying  the 
theme.  When  one  is  simulating  the  process  of  exposure 
on  a  computer,  it  is  not  sufficient  to  assume  that 
there  is  a  one-to-one  correspondence  between  appearance 
of  the  theme  and  exposure  in  a  population;  therefore, 
each  message  is  not  equivalent  to  every  other  message 
and  matters  of  format,  attractiveness,  and  locati  .1  of 
the  message  become  quite  important. 

Just  how  important  these  factors  are  for  reader- 
ship  or  listenershit?  is  the  subject  of  smother  kind  of 
study,  namely,  the  effects  of  format,  location,  appear¬ 
ance,  etc.,  upon  the  probabilities  of  exposure  via 
various  channels  for  various  audience  types.  We  have 
explained  in  some  detail  in  Chapter  VII  how  such  a 
study  might  become  quite  an  elaborate  affair. 

Finally,  even  after  all  these  studies  have  been 
done,  the  organization  of  all  this  data  into  the  frame¬ 
work  of  the  simulation  and  the  processing  of  survey 


type  data  from  the  content  analysis  or  the  message 
exposure  probability  analysis  into  the  kinds  of  files 
and  data  useful  for  the  simulation  Jis  itself  a  tedious, 
time-consuming  undertaking. 

We  have  had  problems  in  evaluating  the  simulation, 
because  for  the  best  test— comparing  actual  and  simu¬ 
lated  exposures  to  messages  (rather  than  to  vehicles) — 
we  must  depend  upon  indirect  asures  of  actual  expo¬ 
sure.  Even  these  measures  cure  possibly  confounded  by 
the  variations  in  prior  exposure  and  exposures  via  in¬ 
formal  communication  channels.  We  suggest  that  a  better 
test  might  involve  simulating  exposure  to  information 
about  a  new  product  first  being  introduced  into  a  com¬ 
munity.  /T  vertheless,  the  model  does  succeed  in  syn¬ 
thesizing  consistently  the  input  data  and  it  does  pro¬ 
duce  plausible  distributions  of  exposures.  Acknow¬ 
ledging  that  we  are  simulating  only  the  portion  of  ex¬ 
posure  due  to  mass  media  messages  flows,  we  do  feel 
that  the  model  is  valid  for  this  purpose  if  sufficient 
attention  is  paid  to  the  input  data. 

We  would  advise  the  prospective  researcher  to  give 
some  attention  to  the  relative  need  for  precision  and 
comprehensiveness  in  his  input  data  before  he  begins 
to  gather  it.  In  simulating  with  several  sources  of 
data,  there  is  always  a  temptation  to  be  as  precise  as 
possible  about  each  data  set;  however,  this  can 


Table  vin-23.  The  Relative  Distributions  of  the  Average 

Audiences  for  the  Three  Weekday  Newspapers, 
by  Sex,  Age,  and  Education 


Male 

Female 

Age 

Age 

Education 

21-39 

40- 

21-39 

40- 

Totals 

Enquirer 

College 

High  School 
Grade  School 

8.9%a 

12.3 

3.0 

11.2% 

7.6 

9.7 

4.6% 

14.2 

1.9 

3.4% 

11.4 

11.8 

28.1% 

45.5 

26.4 

Total 

24f2% 

28,5% 

20.7% 

26.6% 

100.0% 

~  52 

^7%~ 

47 

.3% 

Totals : 

21-39 

=  44% 

40-  - 

55.1% 

Post 

College 

High  School 
Grade  School 

7.1% 

13.5 

6.6 

4.5% 

8.1 

8.8 

3.1% 

17.5 

3.8 

1.2% 

11.4 

14.2 

15.9% 

50.5 

33.4 

Total 

27.2% 

\ 

48 

21.4% 

24.4% 

26.8% 

99.8% 

51 

.2% 

Totals : 

21-39 

-  51.6% 

40-  = 

48.2% 

Times-Star 

College 

High  School 
Grade  School 

5.5% 

10.8 

4.1 

8.5% 

7.8 

11.9 

4.1% 

14.0 

3.9 

3.6% 

11.9 

13.9 

21.7% 

44.5 

33.8 

Total 

20.4% 

[ _ 

28.2% 

22.0% 

29.4% 

100.0% 

48 

.6% 

51 

.4% 

Totals : 

21-39 

=  42.4% 

40-  = 

57.6% 

» 


^he  percentages  are  the  percentages  of  each  subgroup  in  the  average 
audience  of  the  newspaper. 


obviously  lead  to  misplaced  precision  and  a  misalloca- 
tion  of  effort  and  money.  In  deciding  where  to  invest 
his  effort f  we  do  not  unqualifiedly  advocate  that  the 
researcher  try  to  match  the  level  of  precision  of  the 
different  data  sets#  but  rather  that  he  try  to  specify 
the  importance  of  precision  and/or  comprehensiveness 
of  each  data  set  for  his  research  goals.  Let  us  illus¬ 
trate  from  the  present  simulation. 

In  Chapters  III  through  VI#  we  describe  a  very 
large  investment  of  effort  in  producing  the  probabil¬ 
ities  of  vehicle  exposure  for  the  Cincinnati  mass  media 
system.  However,  we  saw  that  the  content  analysis  was 
not  reliable  (we  estimate  that  twenty-five  percent  of 
the  relevant  messages  were  missed  by  the  two  coders) 
and#  in  addition#  because  of  storage  problems  we  were 
forced  to  discard  twenty  percent  of  the  messages  in 
one  of  the  most  important  themes.  We  believed,  there¬ 
fore#  at  first,  that  much  of  the  effort  in  specifying 
the  media  system  was  misplaced.  Now#  however,  we  will 
argue  that  even  more  effort  should  have  been  invested 
into  specifying  the  vehicle  audiences ,  especially  for 
radio#  and  perhaps  also  in  relating  message  exposure 
probabilities  to  the  content  of  themes.  This  is  be¬ 
cause  we  feel  that  the  most  interesting  predictions  of 
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the  model  relate  to  the  distribution^  of  exposure 
rather  than  to  the  overall  level  of  exposure.  Ihe 
former  is  a  function  of  the  vehicle  audience  distribu¬ 
tions,  the  vehicle  distribution  of  the  messages  (a 
relative  statistic  which  could  be  measured  by  a  small 
sample  of  messages) ,  and  the  ratios  of  message  expo¬ 
sure  probabilities.  The  level  of  exposure  is  a  func¬ 
tion  of  the  total  level  of  vehicle  audiences,  the  num¬ 
ber  of  messages,  and  the  average  proportions  of  the 
vehicle  audiences  exposed  to  a  theme  (PORTN) . 

Thus,  if  we  are  interested  in  the  changes  in  the 
audience  distribution  from  theme  to  theme,  the  radio 
audiences  which  were  differentiated  only  by  size  and 
sex  breakdown  are  prime  candidates  for  further  specifi¬ 
cation.  Moreover,  since  the  formating  of  messages 
seems  to  account  for  only  fifty  percent  of  the  variance 
in  message  exposure  probabilities  for  international 
news  themes,  the  relationships  of  these  probabilities 
for  the  various  population  types  to  the  content  of 
these  themes  would  seem  a  worthy  area  for  research . 

As  for  the  content  analysis,  the  fact  that  the  correl¬ 
ations  do  not  change  from  time  period  to  time  period 

^The  knowledge  that  females  of  ~i*ade  school  edu¬ 
cation  and  low  SES  are  (according  to  tus.  simulation) 
ten  to  fifteen  times  less  exposed  than  males  of  high 
school  education  and  high  SES  goes  very  far  in  explain¬ 
ing  the  failure  of  the  Cincinnati  U.N.  information 
campaign. 
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and  that  the  same  groups  are  highly  or  little  exposed  for 
themes  of  from  30  to  1700  messages,  indicates  that  we 
could  have  run  the  simulation  through  just  the  first  time 
period  instead  of  thirteen  (saving  twelve-thirteenths  of 
the  seven  hours  of  computer  time) ,  or,  equivalently,  we 
could  have  taken  only  a  small  sample  of  messages  in  the 
original  content  analysis,  and  still  have  produced  the 
same  pattern  (not  level)  of  exposures. 

For  this  particular  simulation  model,  we  see  four 
changes  or  additions  which  would  greatly  increase  its  use¬ 
fulness.  First,  it  seems  ofcvious  that  frequency  distribu¬ 
tions  as  well  as  averages  of  expected  exposures  should  be 
available  for  the  population  subgroups.  Also  the  variances 
of  the  exposures  should  be  reported.  Second,  the  statis¬ 
tics  should  be  maintained  in  the  computer  for  such  uses  as 

plotting  graphs  over  time  and  coupling  with  models  of  ef- 

17 

feet  and/or  word-of-mouth  communications.  Third,  we 
would  prefer  that  the  audience  duplication  across  themes 
be  defined  in  terms  of  at  least  X  exposures  to  theme  A  and 
Y  exposures  to  theme  B  (where  the  researcher  is  free  to 
choose  X  and  Y)  rather  than  in  terms  of  at  least  one  expo¬ 
sure,  as  is  presently  done.  Finally,  we  are  not  satisfied 
with  the  handling  of  message  exposure  probabilities  in 

17 

Except  for  reporting  of  the  variances,  these  first 
two  recommendations  have  already  been  incorporated  into 
the  simulation  in  the  version  reprogrammed  for  the  IBM 
360-65. 
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terms  of  means  for  the  population  types.  At  present,  two 
messages  appearing  in  a  vehicle  sure  treated  as  two  separ¬ 
ate  appearances  of  the  vehicle,  each  carrying  the  message. 
Thus,  a  person !s  exposure  to  the  second  message  is  inde¬ 
pendent  of  his  exposure  to  the  first.  Obviously,  this  is 
a  distortion  of  reality,  just  as  the  assumption  of  inde¬ 
pendence  in  vehicle  exposures  is  a  distortion,  but  in 
this  instance  it  appears  more  serious.  We  might  model 
this  better  by  considering  cumulation  in  message  expo¬ 
sures  to  differentiate  message  exposure  probabilities 
within  the  population  types. 

We  have  one  final  admonishment  for  builders  of  simu¬ 
lations.  Since  we  always  have  measurement  error  and  exo¬ 
genous  variables,  we  should  always  attempt  to  describe 
outcomes  with  probability  distributions,  or  at  least  ex¬ 
pected  values  and  variances.  Also,  we  should  beware  of 
costly  Monte  Carlo  solutions  when  expected  values  and 
variances  will  save  time  (money)  and  provide  a  more  com¬ 
plete  description  of  the  likely  outcomes. 


APPENDIX  A 

RADIO  AUDIENCE  MEASUREMENT 

One  of  The  sources  of  radio  audience  data  in  this 
simulation  is  the  Hooper  ratings.  These  ratings  are  pro¬ 
duced  by  the  following  process:  within  a  given  city,  calls 
are  placed  to  a  random  sample  of  the  telephone  homes  located 
within  the  non- toll  call  area  of  the  city.  If  there  is  an 
answer  within  six  rings  the  respondent  is  asked  the  following 
questions:  "Were  you  listening  to  your  radio  just  now?  ... 

To  what  program  were  you  listening,  please?  ...  Over  what 

1 

station  is  that  program  coming?" 

These  calls  continue  through  the  day.  The  rating 
for  a  particular  program  is  then  the  percentage  of  the  homes 
called  which  are  listening  to  the  program  while  it  is  being 
broadcast . 

Nielsen  has  pointed  out  several  difficulties  with 
2 

this  measurement: 

1.  The  sample  can  only  be  considered  representative 
of  the  telephone  homes  in  a  city. 

1 

Hooper,  "Introduction,"  City  Hooperatings ,  p.  2. 
Additional  details  of  the  Hooper  coincidental  method  and 
trends  in  the  ratings  over  several  years  may  be  found  in 
Mathew  N.  Chappell  and  C.E.  Hooper,  Radio  Audience  Measure¬ 
ment  (New  York:  American  Book-Stratford  Press,  Inc.,  1944). 
o 

Arthur  C.  Nielsen,  New  Facts  About  Radio  Research 
(New  York:  A.C.  Nielsen  Co.,  1946),  pp-  26-27. 
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2.  The  meaning  of  the  word  listening  must  be  decided 
by  each  individual  as  he  answers  the  question. 
Therefore  the  quality  of  listening  is  variable. 
(Hooper  shows,  however,  that  ”...  a  program's 
effectiveness  in  influencing  the  behavior  of 
listeners  is  quite  independent  of  any  conscious 
impression. "w ) 

3.  The  n’unber  of  members  of  the  family  who  are 
listening  is  not  measured. 

4.  That  member  of  the  family  most  likely  to  answer 
the  phone  is  the  one  least  likely  to  be  listening. 

5.  Busy  signals. 

6.  Refusals  to  talk. 

7.  Deliberate  misrepresentations  and  errors. 

8.  Seme  homes  have  multiple  radios  and  different 
members  of  the  family  may  be  listening  to  differ¬ 
ent  programs,  thereby  confusing  the  meaning  of  the 
measurement. 

We  will  concern  ourselves  here  with  the  most  impor¬ 
tant  problem:  how  representative  of  the  homes  of  the  Cincin¬ 
nati  Metropolitan  District  are  the  telephone  homes  located 
in  the  non-toll  call  area  of  the  city?  According  to  the 

3 

Chappell  and  Hooper,  Radio  Audience  Measurement, 


p.  103. 


telephone  company  in  Cincinnati,  approximately  eighty-three 
percent  (135,283)  of  the  households  in  the  city  of  Cincinnati 
had  residential  phone  service  in  1948.  An  additional  17,412 
optional  subscribers  in  Ohio  and  31,106  subscribers  in  northern 
Kentucky  also  had  toll-free  service  to  Cincinnati  phones  in 
1948.  Most  of  these  additional  phones  were  located  within 
the  Metropolitan  District.  The  total  of  170,380  toll-free 
subscribers  within  the  Metropolitan  District  amounted  to  62 
percent  of  the  households  in  the  area  (using  the  1950  census 
estimate  of  276,715  households  in  the  Standard  Metropolitan 
Area).  Looked  at  as  sixty- two  percent  of  the  households, 
the  population  represented  by  the  Hooper  sample  would  seem 
a  poor  basis  for  audience  estimates  for  the  Metropolitan 
District.  However,  we  have  several  good  reasons  for  using 
the  Hooper  ratings: 

1.  Most  compelling,  they  are  the  or">y  data  available. 

2.  Many  other  measurements,  especially  the  content 
of  radio  news  broadcasts,  are  quite  imprecise. 

3.  The  telephone  homes  did  probably  represent  about 
eighty-three  percent  of-  the  households  in  the  non¬ 
toll  call  area.  There  are  no  obvious  grounds  for 
believing  that  the  telephone  homes  outside  the  non¬ 
toll  call  area  (but  still  within  tne  Metropolitan 
District)  were  substantially  different  from  those 
within  the  non-toll  call  area.  Therefore,  we  may 

ii 

These  figures  and  a  map  of  the  non- toll  call  and 
optional  toll-free  areas  for  1948  were  provided  by  the  Cin¬ 
cinnati  and  Suburban  Bell  Telephone  Company. 


have  substantially  more  confidence  in  the  sample  than  the 
sixty-tvo  per  cent  figure  would  indicate. 

The  Hooper  ratings  are  coincidental  ratings;  they  measure  the  average 
audience  of  the  program,  not  total  audience,  since  the  audience  changes  from 
minute  to  minute.  A  respondent  who  listens  to  only  half  of  a  program  on  each 
of  two  occasions  would  likely  be  counted  in  the  audieoce  only  once,  not  for 
both  programs.  It  is  the  equivalent  full-time  audience  which  is  measured. 
This  is  an  underrepresentation  of  the  true  audience.  The  analogous  measure¬ 
ment  of  a  newspaper  audience  would  count  the  person  who  reads  ten  per  cent  of 
the  newspapers  as  only  one-tenth  of  a  person  in  totaling  the  audience!  Niel¬ 
sen,  on  the  basis  of  electronically  received  (Audimeter)  ratings,  estimates 
that  the  average  audience  measurement  cuts  the  ratings  of  stations  and  progrm 
by  twenty  to  forty  per  cent.  Of  course,  the  longer  the  programs,  the  more 
likelihood  of  changes  in  the  audience,  and  therefore  the  greater  the  differ¬ 
ence  between  the  total  audience  and  the  coincidental  audience.  The  chart 
below  (Figure  A— 1  )  demonstrates  quite  clearly  the  effect  of  program 

length. ^ 


Program  Duration 

Figure  A-1 .  Total  Audience 
vs.  Average  ("Coincidental”) 

Audience. 

^Nielsen,  New  Facts,  p.  46. 

^ The  chart  is  from  Nielsen,  New  Facts,  p.  52.  Although  the  dates  are  not 
provided,  the  data  probably  esa  :  from  1945  Audimeter  measurements . 
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Other  Nielsen  data,  represented  in  the  chart  below 

(Fig.  A-2),  seem  to  show  the  effect  of  type  of  program  upon 

7 

the  difference  between  the  total  and  average  audience.  The 
data,  given  in  terms  of  the  percentage  of  the  total  program 
time  listened  to  by  the  average  audience,  demonstrate  that 
news  broadcasts  rank  second  only  to  the  daytime  serials  in 
holding  the  audience.  However,  we  must  be  cautious  in  attri¬ 
buting  the  differences  in  holding  power  to  the  program  types, 
since  it  seems  likely  that  they  fall  into  an  order  of  length, 
i.e.,  the  daytime  serials  and  news  broadcasts  which  seem 
to  have  the  greatest  holding  power  tend  to  be  fifteen  minutes 
in  length,  while  at  the  other  extreme  the  concert  music 
broadcasts  which  have  the  poorest  holding  power  are  probably 
also  the  lengthiest  programs. 

Since  the  amount  of  international  news  per  broadcast 

g 

is  usually  about  three  continuous  minutes,  we  expect  from 
the  Nielsen  figures  that  the  total  audience  exposed  to  it 
is  very  nearly  equal  to  the  average  audience 


n 

This  Nielsen  data  is  cited  in  C.H.  Sandage,  Radio 
Advertising  for  Retailers  (Cambridge:  Harvard  University  Press, 
I  >45) ,  pp.  142-143.  No  date  is  given. 

8 

See  the  results  of  Xavier  University  Study  cited 
above,  p.  121. 
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Percent  of  total  prograa  tine 
listened  to  by  average 
audience 


Figure  A-2.  Holding  Power  Variations 
in  Different  Types  of  Programs. 
(Source:  Hlelaen  Radio  Index, 

A.C.  Hlelaen  Conpany,  Chicago) 


APPENDIX  B 


A  PROBABILISTIC  MODEL  OF  EXPOSURE 


Assign  a  number  to  each  member  of  the  population 
from  1  to  K,  e.g. ,  j  =  I(  2,  3,  .  .  .  ,  K. 

Define  the  event  A^s 

A?  =  the  jth  member  of  the  population  is  exposed 
at  least  once  during  the  ith  day. 

Define  the  random  variable  and  associate  with 
it  the  probability  P^  in  the  following  manner. 

a  _  f  1  if  the  event  A?  is  observed,  and  this  happens 
xJ  =  \  with  probability  P 

V  0  otherwise,  with  probability  1-P  j . 

Pj  is  a  number  between  zero  and  one  such  that  if  one  ob¬ 
serves  over  n  days,  then  in  the  limit  as  n  becomes  very 
large,  Pj  represents  the  frequency  of  exposure  of  the  jth 
person: 


In  this  first  simple  model  we  assume  that  the 
probabilities  Pj  are  constant  over  time  and  independent 
from  person  to  person  and  from  day  to  day.  Thus  the 
process  of  the  jth  person  being  exposed  on  the  ith  day 
is  a  Bernoulli  process. 


The  probability  P^  is  the  probability  of  the  event 
A?.  But  we  also  assume  that  it  is  characteristic  of  the 
jth  person  in  the  same  way  that  a  coin  has  a  characteristic 
probability  of  heads.  It  attaches  itself  to  the  jth  person 
in  that  it  represents  the  likelihood  per  day  of  his  being 
exposed  via  the  particular  vehicle  in  question.  When  we 
attempt  a  computer  simulation  of  the  process  of  exposure 
of  a  population  via  the  vehicle,  we  store  as  data  a  list 
of  these  probabilities,  one  probability  for  each  member 
of  the  population. 

Below  we  shall  define  a  random  variable  whose 

value  is  the  number  of  people  in  the  population  who  are 
exposed  on  exactly  r  of  n  days.  We  now  make  the  assumption 
that  the  unknowns  in  the  model  (the  values  of  K  and  the 
P^'s)  are  so  related  that  the  expected  values  of  the  random 
variable  for  r  <=  0, 1, 2, . . .  ,n  are  "very  close"  to  the 
observed  values  of  the  frequency  curve. *  This  assumption 
indicates  how  the  model  is  related  to  the  observed  data 
and  implies  some  relations  among  the  unknowns;  however  it 
falls  far  short  of  completely  specifying  the  unknowns  (even 
how  many  there  are)  and  this  task  requires  a  much  stronger 
assumption,  namely  the  form  of  the  distribution  of  the  Pj’s 
(to  be  discussed  below) . 

*  I  think  we  should  ideally  assume  that  the  unknowns 
are  so  related  as  to  make  the  most  likely  values  of 
0,1,2,  ...,n  "very  close"  to  the  observed  values  of.  the17' fre¬ 
quency  curve.  However,  as  we  shall  see  below  it  is  practically 
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Now  we  show  the  implications  of  the  model.  If  we 
define  a  random  variable  r j  representing  the  number  of 
times  out  of  n  that  the  jth  person  is  exposed: 

r-  -I  x0 

j  t*.  ■ 


then  rj  is  just  the  sum  of  n  Bernoulli  trials  and  its 
probability  is  the  binomial  probability.  (We  drop  the 
j  on  the  r  here  because  it  is  carried  on  the  P^’s.) 

The  probability  that  the  jth  person  will  be  exposed  on 
exactly  r  of  n  days,  given  P^,  is 


Now  let  us  define  the  new  random  variable: 


Y.  = 
J 


1  if  the  jth  person  is  exposed  on 
exactly  r  of  n  days,  with  proba¬ 
bility  P^ 

C  otherwise,  with  probability  1-Pj  « 


Then  the  number  of  people  in  the  population  who  are  exposed 
on  exactly  r  of  n  days  is  given  by  the  random  variable: 


'll"  -  1  Y-  . 

i«*  J 


impossible  to  calculate  the  joint  likelihood  function  for 
theTfr  end  extremely  easy  to  calculate  the  expectations. 
Probably  both  assumptions  imply  very  nearly  the  same  re¬ 
lationships  among  the  unknowns. 
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It  is  very  difficult  to  write  the  probability  distribution 
for  71  £.  7[  £  can  take  on  values  from  0  to  K.  For  each  of 

these  values  the  associated  probability  consists  of  a  sum 

Y 

of  (value)  terms,  where  each  term  is  the  product  of  K 
probabilities,  the  PJj's  and/or  1-P^  1  s.  However,  we  can 
easily  write  the  expected  value  of  Tf  ”  using  the  rule  that 
the  expected  value  of  a  sum  is  equal  to  the  sum  of  expected 
values,  of  the  terms.  Thus: 

ECJf£)  =  5  0^)  +  E(Y2)  +  ...  +  E(Yk). 

Now  in  general  (from  above) : 


Therefore: 


E(Y-)  - ♦(.-«*>••  • 

J  ** 


(B-i) 


This  is  the  formula  for  the  expected  number  of  people 
who  are  exposed  exactly  r  days  in  n. 

Now  if  we  had  (for  example  only)  people  grouped 
by  probability  intc  h  groups,  of  probability  P^  and  number 
of  persons  n^,  then  the  sum  would  become: 


A-r 


where  ZOh'K 

k*  l 


(B-2) 
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If  we  wish  to  consider  the  continuous  case,  we 
can  do  so  as  follows: 

In  place  of  n^  we  let  N(p)  be  a  population  density 

function  such  that  N(p)dp  «  the  number  of  people  with 

probability  between  p  and  p+dp  (this  corresponds  to 

P„  )  and  such  that 
k 


/Wp'iap  *K 


Then  the  sum  becomes  an  integral  and  the  formula  for  the 
expected  number  of  people  who  are  exposed  on  exactly  r 
of  n  days  is  given  by: 


E(<)”/M:)pVprrdi 


(b‘3> 

<*j)  *  \  T  I  ' 

We  can  easily  return  to  our  first  form  by  letting  the 
population  density  function  be  a  sum  of  Dirac  celta 
functions. 


then. 


wP )-i 


ifp-ppcip'ii-pn. 


We  can  also  derive  the  formula  for  the  variance 


of  Tt  ^  .  Since  the  Y 


j  are  independent,  the  variance  of 


=»-=*• 


the  sum  is  equal  to  the  sum  of  the  variances.  Thus 

Va<  (rrl)  *  Var  (Y| )  +  Var  (^)  +■  .  .  .  +■  V*r  (Yk) 

The  variance  of  Yj  is  just 

Var(Yj)  =  ^'('-P/)  *, 
therefore 

for  the  continuous  case,  the  variance  of 7f”  becomes 

(B.4) 


A  second  quite  valuable  statistic  is  the  cumu¬ 
lative  number  of  people  exposed  at  lease  once  in  t  time 
periods.  We  define  a  new  random  variable: 

f  1  if  the  jth  person  is  exposed  at  least  once 
1  in  t  days,  with  probability 

1  0  otherwise,  with  probability 

Then  the  total  number  of  people  in  the  popula¬ 
tion  who  are  exposed  at  least  once  in  t  days  (the  t- 
period  cumulative  audience)  is  given  by  the  random 


variable 


Again,  it  is  very  difficult  to  write  the  probability 


V 
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X’ ' ~ . 


468 


distribution  for  Ct:  however  we  can  easily  calculate  the 
expected  value  and  variance  of  the  statistic  just  as 
above  forTf 


ECC^-lEte)  a*J  MariQ-lViriz). 
i’1  !"‘ 


Therefore 


and 


E (O-i  D-O-Pj)4] 


j*1 


For  the  case  of  the  continuous  population  density  function: 


Var(0'/N(P^'(’'P)^(''P)tldp 


(B-5) 


(B-6) . 


For  the  continuous  case  Hyett  has  suggested  using 
a  beta  density  function  to  model  the  probability  distribution 


*  6.  P.  Hyett,  paper  read  to  the  Statistics  Seminar, 
London  School  of  Economics,  February,  1958. 
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and  is  reported  to  have  tested  it  successfully  on  data 
from  American  magazines.  Below  we  outline  the  mathematics 
of  the  single  beta  function  (which  we  can  then  extend 
to  the  three  beta  function  case) . 


The  Single  Beta  Function  Model 


0*  p£ I  as 


The  beta  function  of  p  is  defined  over  the  interval 
3 


,  ,  x  p™'-1  (hp r 

fp  (?/<*>')  s  ^(^>1 


where  m1  and  n'  are  two  parameters,  each  greater  than 
zero,  and  ^(m',n')  is  a  constant  chosen  such  that 


/  fp(P/m»n^dP 


This  function  is  often  quite  useful  in  representing 
probabilities  since  it  can  take  on  a  rich  variety  of 
forms  over  the  interval  depending  upon  the  choice  of 
the  two  parameters.  These  forms  can  be  as  diverse  as 
a  U-shape,  a  horizontal  line,  or  a  bell  shape,  and  can 
be  symmetric  or  skewed  toward  either  end  of  the  range 
The  mean  of  the  function  is  given  by 


—  rv\ 


mV  a' 


This  notation  is  read  "the  beta  function  of  p  given 
m '  and  n ' . " 
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and  the  variance  by 

£  -P('-P’)  ■^7T„~T 

Also  the  maximum  of  the  function  occurs  at 
^  w'-l 


w 


'+n,-2. 


except  for  the  cases  n 1 ^ 1  and/or  m ’ ^  1  in  Which  cases 

the  maximum  occurs  at  the  points  p=0  and/or  p=l.  The 

next  page  shows  plots  of  the  function  for  several  values 
4 

of  m‘  and  n' . 

It  can  be  shown5  that 


wl  ere 


,  _  MUhi 

^n>~  ((«'♦«  j 

Rx+0  -  x  Rx) 


(B-7) 


(B-fe) 

Multiplying  the  beta  function  probability  dis¬ 
tribution  by  the  population  size,  K,  gives  the  popula¬ 
tion  density  function 


NW-ni  (P/r*'X). 


’V.Q 

Then  N(p)dp  is  the  number  of  people  with  probability 
between  p  and  p+pd  and  the  total  population  is  just 


/ 


N(p)dp  =  K  =  the  total  population. 


For  a  more  detailed  discussion  of  the  beta  function 
see  Howard  Raiffa  and  Robert  Schlaiffer,  Introduction  to 
Statistical  Decision  Theory  (New  York:  McGraw-Hill, 19657. 

5  Ibid. 
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Now  with  this  function  the  expected  number  of 
people  exposed  at  least  once  through  t  time  periods 
follows  from  equation  (B-5)  above. 


E(Ct’)«jf'  N^O-O-^Up 


Using  the  relationships  between  the  complete 
beta  function,  the  gamma  function,  and  factorials 
(equations  (B-7)  and  (B-8))  we  simplify  the  result 


E(CtVk[>- 


f(m')  f(A')  rCm'+tt'+t) 


(yy+i -*)(*• <V 
(vn'+rvVt-iKw'+n'+t  -2^ 


(B-9) 


This  formula  provides  the  expected  cumulation 
through  any  number  of  periods,  if  the  values  of  n*  and 


m*  are  known. 

Typically  our  data  will  consist  of  measurements 
or  estimates  of  the  average  audience  (C^)  and  the  two- 
period  cumulation  (C2) .  Given  these  values  we  can 
solve  for  the  two  parameters  of  the  beta  function. 

From  (B-9)  with  t  ■  1  and  t  -  2 

E(c)  (t-l) 

\  -ty  ft  V  1 

*  UvV+iYmW)  j  lW). 

Since  our  purpose  is  to  equate  these  expected  values 
with  the  measured  or  estimated  values 

E (C^)  =  CL 

e(c2)  =  c2 

and  therefore 

p  -  V  [\ 
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Solving  these  two  equations  for  the  values  of  n* 
in  terms  of  and  gives 


(K-0(C,-O 


and  m' 


(B-10) 


and 


<V  .* 


C,  _  c.(c,-cj  _ 

<-c.  KCj-^KC,*^ 


(B-U). 


Equations  (B-10)  and(B-ll)  enable  the  calculation  of  the 
values  of  the  parameters  from  C ^  and  C2.  However  they 
also  imply  certain  limitations  on  the  values  of  and 
which  can  be  modeled  by  the  distribution.  Since  na  must 
be  greater  than  zero  (in  order  that  the  integrals  converge) 
it  follows  from  equation  (B-10)  that 


given  K  >  C^,  i.e. ,  the  total  propulation  greater 
than  the  average  audience  and 
c2>  C,,i.e.,  the  two-period  accumulation 
greater  than  the  average  audience, 
then  the  denominator  of  equation  (B-10)  must  be  negative 
if  n'  is  to  be  greater  than  zero. 


2 

+  C1 


4 


0 


or 


£2  <  2CJL  _ 

K  K  k2 


(B-12) . 


In  words,  this  equation  implies  that  the  mean 
proportion  of  the  population  exposed  through  two  periods 
must  be  less  than  twice  the  average  audience  proportion. 
The  equality  of  the  left  and  right  hand  sides  of  equation 
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(B— 12)  would  imply  a  random  process  with  each  individual 
having  the  same  probability  of  exposure,  i.e.,  the  entire 
population  lumped  at  the  same  probability.  Equation  (B-12) 
implies  that  this  condition  is  the  limiting  form  which 
can  be  modeled  by  the  beta  density  function. 

The  variance  of  these  expected  cumulations  is 
found  from  equation  (B— 6 ) 


Var(Ct)=^'N(p^[i-0-p'ik](|'p)  dp 


-K 


(B-13) 


We  also  calculate  the  mean  and  variance  of  theTfn. 

r 

Using  the  same  population  density  function  the  expected 
number  of  people  exposed  on  exactly  r  of  n  days  is  (from 
equation  (B— 3)): 

= /Vc(f>/->,)0)pr(|-pr% 


'o  '  •  9 
K 


P  (\-p) 


<Ap 
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This  can  be  "simplified"  by  use  of  the  relations  in  equations 
(B-7)  and  (B-8) : 

C— / '  n.\  _  M  v 

E(Tff'  =  WVv«VV+n^')T«-) 

~(w  •**  (m+h') 

(B-14). 


The  variance  of  TT™  is  calculated  from  equation  (B-4), 

Var«)«^N(f^)p'(l-p)  ['-(?)  ^(I-Pf  ]<»p 

n'+Zv{-2r) 


(B-15) 


Summary  of  the  One  Beta  Function  Model 

The  population  is  distributed  as  a  function  of  the 
probability  according  to 
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where  K  equals  the  total  population,  and  m*  and  n'  are 
determined  by  fitting  known  values  of  and  C^.  These 
values  must  satisfy  the  following  conditions 


and 


K  >C2>  C 


1 


The  Three  Beta  Function  Model 


(B-16) 


(B-17). 


The  population  is  considered  to  consist  of  three 
croups,  the  frequent,  infrequent,  and  casual  audience 
of  a  vehicle.  For  the  case  of  a  magazine,  the  frequent 
readers  might  be  the  subscribers  and  their  families, 
the  casual  readers  might  be  the  newsstand  buyers  and 
passalong  readers,  and  the  infrequent  readers  the  rest 
of  the  population.  It  will  be  typical  of  this  approach 
that  the  values  of  some  parameters  will  be  residual, 
as  well  as  the  estimate  of  the  number  of  people  in  one  of 
the  three  groups.  This  is  because  the  best  empirical 
values  will  usually  apply  to  the  total  population.  The 
following  equations  summarise  the  obvious  relationships 
between  the  parameters  and  population  values  for  each 
of  the  three  groups  and  the  totals  for  the  entire  popu¬ 
lation.  (The  Roman  superscripts  on  the  C's  indicate 
the  group  to  which  they  refer.  Those  without  Roman 
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superscripts  refer  to  the  total  population.)  First  the 
contributions  to  the  cumulation  from  each  of  the  distri¬ 
bution  must  sum  to  the  total  for  the  Whole  population. 
Therefore 

Cj-  +  cj1  +  C*11  =  Cx  (B-18) 

•  •  •  •  «  • 

C2  +  C2X  +  C2X1  =  °2  (B-19) 

Also  the  population  values  of  the  three  groups  must 
sum  to  the  total,  i.e 

*  •  •  «  «  • 

K1  +  K11  +  K111  =  K  (B— 20) . 

In  general  the  values  of  C^,  C2,  and  K  will  be  the  best 
estimates.  Therefore  for  each  equation  it  suffices  to 
specify  only  two  of  the  values  for  the  subgroups.  In  the 
example  below  the  number  of  infrequent  readers  is 
calculated  as  a  residual.  Using  equation  (B-20) 

K1  =  K  -  Kxi  -  K3"11 

or  in  words, 

(number  of  \  I  total  \  /newsstands  /subscriber* 
infrequent)  =  I  population]  -l  buyers  j  -land  theiz  I 
readers  /  \  /  \  /  ^families  j 

The  problem  of  Chapter  III  is  just  to  make  these  estimates 
of  the  best  values  for  the  K's,  C^'s  and  C^'s  for  the  various 
media.  Of  course  the  restrictions  of  (B-6)  and  (B-7) 
apply  to  each  set  of  [k, 0^,02], 


The  audience  data  for  the  Cincinnati  vehicles  of 
1947-1948  is  not  sufficient  to  adequately  test  the  beta 
function  model.  For  none  of  the  vehicles  do  we  have 
any  data  beyond  a  good  estimate  of  the  average  audience; 
for  the  two-period  cumulations  we  have  made  rough  estimates 
on  the  basis  of  these  average  audiences  and  educated 
guesses  about  distributions  and  their  mean  exposure 
probabilities  (see  Chapter  V  for  details).  Therefore 
we  must  turn  to  data  from  other  vehicles  and  media  to 
assess  the  usefulness  of  the  beta  function  model. 


The  Fit  of  the  Beta  Function  Model  to  Empirical  Data 


As  we  have  pointed  out  in  the  Introduction,  the 
beta  function  model  appears  to  have  been  first  suggested 

0 

by  Hyett  in  a  seminar  at  the  London  School  of  Economics. 
Hyett  proposed  using  one  beta  function  for  the  total 
population  and  is  reported  (by  Metheringham)  to  have 
tested  the  model  on  empirical  data  with  favorable  re¬ 
sults.  Metheringham  extended  Hyett 's  calculations  to 

duplication  between  vehicles  in  an  article  in  the  Journal 
_  ...  _  .7 


Research. 


GoP.  Hyett.  Paper  read  to  the  Statistics  Seminar, 
London  School  of  Economics,  February,  1958. 

7 

Richard  A.  Metheringham  "Measuring  the  Net  Cumula¬ 
tive  Coverage  of  a  Print  Campaign, "  Journal  of  Advertising 
Research,  Voi.  4,  No.  4  (December,  1964),  pp.  23-28. 
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These  previous  attempts  have  taken  a  single  beta 
function  and  used  the  average  and  the  two-period  cumulative 
audience  to  fit  the  two  parameters  of  the  beta  function. 
However,  this  method  in  general  does  not  provide  the 
best  estimate  of  the  two  parameters  of  the  beta  function, 
especially  if  additional  cumulation  data  are  available. 

In  this  case,  we  could  choose  the  values  of  the  parameters 
which  maximize  the  probability  of  getting  the  observed 
cumulation  curve  (maximum  likelihood  estimation) ,  or 
combine  the  likelihood  function  with  a  prior  estimate 
of  the  parameters  to  get  a  Bayesian  estimate.  A  third 
possibility  would  be  to  choose  the  parameters  so  as  to 
minimize  the  summed,  squared  deviations  between  the  ob¬ 
served  cumulation  curve,  and  that  implied  by  the  beta 
function.  Note  however  that  the  beta  function  distribu¬ 
tion  of  probabilities  implies  an  infinite  set  of  frequency 
and  cumulation  curves.  Thus  one  least  squares  procedure 
would  attempt  to  equate  the  average  or  "expected"  cumula¬ 
tion  curve  with  the  observed  curve.  This  is  equivalent 
to  the  procedure  outlined  above  (p.  474)  When  only  the 
first  two  cumulation  values  are  available  (i.e.,  fitting 
and  ).  Since  these  values  are  generally  the  only 
values  available  (at  best)  to  the  researcher  attempting 
to  simulate  a  mass  media  system,  we  have  chosen  this 
estimation  procedure  and  shall  use  it  in  the  following 
exploration  of  the  fit  of  model  to  empirical  data. 
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There  are  some  complications  to  this  method  of 
estimation.  First,  let  us  note  that  it  seems  unreasonable 
that  every  point  in  the  cumulation  curve  should  carry 
equal  weight.  For  example,  the  cumulative  two-period  audi¬ 
ence  (C2)  includes  the  audience  from  (the  average  audience) . 
In  no  sense  can  we  imagine  that  the  second  point  (C2)  on  the 
curve  is  independent  on  the  first  point.  This  holds  true 
for  all  the  rest  of  the  points  on  the  curve.  For  this 
reason,  it  would  seem  reasonable  to  assign  decreasing 
weight  to  the  values  of  the  observations  as  the  time 
period  increases.  This  would  be  the  reasonable  way  to 
allot  weight  to  the  empirical  data;  however,  it  does  not 
necessarily  produce  the  closest  fit  between  the  derived 
and  the  empirical  points,  and  that  closest  fit  is  obviously 
the  least  squares  solution  for  the  two  parameters. 

A  third  reason  for  using  only  and  C 2  in  fitting 
the  curves  is  that  the  derived  curves  are  very  complicated 
functions  of  the  parameters,  and  the  mathematics  involved 
in  a  least  squares  estimation  of  the  parameters  from  all 
the  points  on  the  curves  is  extremely  difficult  and  probably 
impossible,  except  in  some  iterative  technique  on  a  computer. 
For  all  these  reasons,  then,  we  have  followed  the  conventional 
procedure  of  fitting  the  empirical  curves  from  values  of 
and  C2  even  though  this  does  not  necessarily  provide  the 
best  fit  to  the  data  over  the  entire  length  of  the  curve. 

In  general,  it  will  not  be  the  best  possible  fit  to  the 
data  over  the  entire  length  of  the  curve. 


******* 
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In  the  single  beta  function  model  there  are  two 
parameters  to  be  estimated  from  the  data.  However,  in 
the  three  beta  function  model,  we  have  a  total  of  eight 
parameters  which  must  be  estimated  from  the  data.  In 
this  case,  for  each  distribution  we  must  estimate  and 
C2  and  for  two  of  the  distributions  we  must  estimate  the 
proportions  of  the  population  belonging  to  the  distribu¬ 
tions.  The  third  proportion,  of  course,  is  the  proportion 
needed  to  make  the  entire  population  sum  to  100  per  cent. 
Given  the  difficulty  in  getting  empirical  values  for  the 
one  beta  function  model,  one  might  imagine  that  it  is 
doubly  difficult  to  get  empirical  values  for  the  eight 
parameters  of  the  three  beta  function  model.  (For  an 
example  of  the  kinds  of  reasoning  used  in  generating  these 
parameters  and  the  relationships  between  thesu  parameters, 
see  Chapter  V. )  Because  of  the  imprecision  in  the  estimates 
of  these  parameters,  we  have  made  some  attempt  to  examine 
the  problem  of  what  difference  the  parameters  maJce  in 
values  of  cumulation  and  frequency  of  exposure.  To  do 
this,  we  have  taken  as  given  two  values,  the  average 
audience  and  the  two-period  cumulative  audience,  for  the 
entire  population.  This  completely  specifies  the  parameters 
for  the  one  beta  function  model,  but  it  leaves  six  of  the 
eight  parameters  free  in  the  three  beta  function  model. 

The  question  we  ask  is:  given  that  the  cumulation  and 
average  audience  are  fixed  for  the  first  two  time  periods 
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for  the  entire  population,  what  difference  does  juggling 
the  other  parameters  make  in  the  values  of  cumulation  and 
frequency  of  exposure?  To  answer  this  we  have  taken  three 
cases  for  exploration.  This  is  actually  only  an  explora¬ 
tion  because  it  would  be  very  difficult  to  examine  all 
of  the  possibilities,  the  infinite  number  of  combinations 
of  the  six  remaining  free  parameters,  even  though  they 
are  somewhat  constrained  by  each  other.  In  two  of  the  three 
explorations  we  have  a  great  deal  of  empirical  data  avail¬ 
able  with  which  to  compare  the  one  beta  function  and  three 
beta  function  models.  In  the  last  case  we  have  taken  hypo¬ 
thetical  values  for  the  average  audience  and  the  two-period 
combination  which  might  be  appropriate  to  a  newspaper  and 
explored  the  consequences  of  various  combinations  of  the 
remaining  six  parameters  in  the  three  beta  function  model. 

Instead  of  an  exhaustive  exploration,  what  we  have 
done  is  choose  values  for  the  parameters  which  seem  plausible 
in  the  light  of  data  or  intuition  about  the  distribution 
of  infrequent  and  quite  frequent  readers  in  the  popula¬ 
tion,  looking  at  the  effects  that  this  variation  in  parameters 
has  upon  the  frequencies  and  accumulations  generated.  In  j 

analyzing  this  data  we  may  keep  two  points  in  mind.  The 
first  of  these  is  that  for  each  distribution,  the  inequality 
constraint  relating  the  maximum  value  of  the  two-period 
cumulation  to  the  average  audience  limits  the  possible 
values  of  that  two-period  cumulation,  given  the  average 
audience.  The  second  point  to  remember  is  that  the  single 
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beta  function  model  is  equivalent  to  a  three  beta  function 
model  when  the  average  and  two-period  cumulative  audience 
as  proportions  of  the  population  of  each  distribution  are 
identical  with  these  proportions  as  found  in  the  popula¬ 
tion  as  a  whole.  Keeping  these  facts  in  mind,  let  us 
look  at  the  first  exploration  of  the  parameters. 

For  the  first  case  there  is  a  great  deal  of  data 
available.  This  is  a  study  of  the  cumulative  and  repeat 
audiences  of  Better  Homes  and  Gardens  over  twelve  issues, 
beginning  in  October,  1954  and  continuing  through  September, 

O 

1955.  For  Better  Homes  and  Gardens  we  chose  the  following 
values  for  the  parameters;  in  each  trial  we  set  the  propor¬ 
tion  of  the  population  in  the  low  distribution  at  20  percent 
with  an  average  probability  of  exposure  of  .01  and  a  pro¬ 
portional  two-period  cumulation  of  .019.  The  values  for- 
the  high  distribution  range  from  two-and-a-half  to  ten 
percent  of  the  population  in  the  high  distribution  with 
an  average  exposure  probability  ranging  from  .80  to  .95 
and  the  proportion  of  the  population  in  the  two-period 
cumulative  audience  ranging  from  .93  to  .99.  The  values 
for  the  middle  distribution  are  found  by  subtraction 
from  the  overall  average  exposure  probability,  which  is 
.125,  and  the  overall  average  proportion  in  the  two- 
period  cumulative  audience,  which  is  .182.  With  those 


This  data  is  taken  from  Alfred  Politz  Research,  Inc., 
A  Twelve  Months 1  Study  of  Better  Homes  and  Gardens  Readers, 
Des  Moines,  Iowa:  Meredith  Publishing  Company,  1956. 


ranges  of  values,  we  look  at  several  of  the  expected 
cumulation  and  frequency  statistics  derived  from  the 
resulting  distributions  (Fig.  B-3) .  It  is  obvious 
from  the  graph  that  one  of  the  cumulative  curves  deviates 
rather  markedly  from  the  empirical  cumulation  curve  and 
also  the  frequency  curve  in  this  case  is  quite  different 
from  the  empirical  curve.  Both  the  single  beta  function 
curve  and  two  of  the  three  beta  function  curves  fit  the 
empirical  data  rather  well.  The  empirical  twelve-period 
cumulative  audience  is  .357.  The  one  distribution  model 
gives  a  predicted  twelve-period  cumulative  proportion  of 
.342,  somewhat  lower  than  the  empirical  value.  For  the 
three  beta  function  case,  however,  the  values  generated 
encompass  the  empirical  value;  they  range  from  a  minimum 
proportion  of  .345  to  an  upper  limit  of  .483.  Thus,  the 
three  beta  function  model  can  cover  the  empirical  values; 
after  twelve  issues  it  has  a  variation  of  thirteen  and  ope- 
half  per  cent  of  the  population  or,  at  most,  it  over¬ 
estimates  the  twelve-period  cumulative  audience  by  about 
30  percent. 

Next  we  look  at  several  of  the  frequency  values 
Which  seem  to  be  important.  The  usual  frequency  curve 
shows  a  large  number  of  people  exposed  none  out  of  N 
times,  somewhat  fewer  exposed  om'e,  and  in  general,  the 
curve  declines  towards  exposure  about  half  of  the  time, 
then  increases  somewhat  for  exposure  to  nearly  all  of 
the  issues,  due  to  the  subscribers  in  the  population. 
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Figure  B-3.  Cumulation  and  Frequency  Curves  For 
Various  Values  of  the  Parameters  For  Better  Homes 
And  Gardens,  1958 
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If  we  look  at  the  frequency  curve  corresponding 
to  the  most  deviant  cumulation  curve  we  see  that  the  fre¬ 
quency  curve  shows  many  more  people  with  very  infrequent 
exposure,  much  fewer  than  average  numbers  of  people  with 
moderate  exposure  and  then  a  slight  growth  in  the  number 
of  people  with  high  exposure.  Thus  the  effect  which 
gives  such  a  high  rate  of  cumulation  is  to  increase  the 
number  of  people  at  each  end  of  the  frequency  curve  and 
decrease  the  proportion  of  cas  3  in  the  middle  .ranges  of 
the  curve. 

The  second  magazine  for  which  we  have  empirical 
data  and  have  generated  a  number  of  cumulation  and  fre¬ 
quency  curve.*  s  a  Swedish  magazine,  Hemmets  Veckotidninq , 

g 

which  was  studied  by  Schyberger  in  Malmo  during  1962. 

For  H.  V. ,  the  average  audience  was  33.4  percent  of-  the 
population  with  an  average  two-period  cumulation  of  40.5 
percent  of  the  population.  Thirteen  different  curves 
were  fitted  with  the  following  values  of  the  parameters; 
the  proportion  of  the  population  in  the  low  distribution 
varied  from  ten  to  forty  percent  while  the  population 
proportion  for  the  high  distribution  varied  from  ten  to 
twenty-five  percent.  The  average  probability  for  the  low 
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The  data  on  this  magazine  and  on  several  other  Swedish 
magazines  is  reported  by  Bo  W:son  Schyberger  in  Methods  of 
Readership  Research,  pp. 93-117.  We  note  here  that  the  general 
method  of  averaging  over  all  the  issues  studied  in  order  to 
get  the  average  audience  and  the  average  cumulation  figures 
was  not  used  by  Schyberger  in  his  reporting.  Therefore,  his 
values  of  two-period  cumulation  are  not  the  values  which  one 


Percent  of  Population  Exposed 


distribution  varied  from  .010  to  .100  with  a  cumulation  vary 
ing  from  .015  to  .170.  For  the  high  distribution  the  average 
probability  varied  from  .800  to  .900  with  cumulation  varying 
from  .900  to  .980.  The  particular  cumulation  curve  which 
comes  closest  to  the  empirical  curve  was  generated  with 
forty  j|prcent  of  the  population  in  the  low  distribution 
with  a  mean  probability  of  .100  and  a  two-period  cumula¬ 
tion  of  .170.  The  proportion  of  the  population  at  the 
high  distribution  was  ten  percent  with  a  mean  probability 
of  .900  and  two-period  cumulation  of  .970.  This  left  fifty 
percent  of  the  population  in  the  middle  distribution  with 
a  mean  probability  of  .408  and  an  average  two-period  cumu¬ 
lation  of  .480.  Looking  at  the  graph  in  Figure  B-4  we 
see  that  the  best  three  beta  function  curve  underestimates 
the  twelve-issue  cumulative  audience  by  about  ten  percent 
when  the  curves  are  fitted  with  the  values  of  the  average 
audience  and  the  two-period  cumulation.  The  single  beta 
function  curve  and  other  three  beta  function  curves  fall 
even  further  below  the  empirical  cumulation  curve.  It 
is  not  clear  why  this  is  so.  It  may  be  that  there  are 
other  problems  with  Schyberger ' s  data  than  the  problem 


would  generally  use  in  a  study  like  this  and  m  fact, 
because  of  a  secular  trend  in  readership  during  the  time 
of  his  study,  his  values  are  consistently  too  low.  We 
have  taken  his  values  and  averaged  them  in  reporting 
them  in  the  present  study. 
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with  the  averaging  in  order  to  get  the  cumulation.  It  may 
also  be  that  cumulation  is  simply  quite  different  for  data 
of  Swedish  magazines  in  Malmo  and  that  the  model  does  not 
fit.  Another  possibility,  of  course,  is  that  the  model 
will  fit  but  that  we  simply  have  not  been  able  to  find 
the  proper  values  of  the  parameters  to  generate  the  maxi¬ 
mum  cumulation  given  the  two  fitted  points. 

The  final  exploration  in  the  effects  of  the  various 
parameters  was  performed  on  hypothetical  data  which  might 
be  appropriate  to  newspaper  cumulation  values.  Here  the 
average  audience  for  a  hypothetical  newspaper  was  taken 
as  fifty  percent  of  the  population  with  a  two-period 
cumulation  of  sixty-five  percent  of  the  population.  The 
proportion  of  the  population  in  the  low  distribution  var¬ 
ied  from  ten  to  forty  percent  with  a  mean  probability  of 
exposure  from  .010  to  .150  and  two- period  cumulation  from 
.015  to  .250.  For  the  high  distribution,  the  population 
proportion  ranged  from  ten  to  thirty  percent,  with  a  mean 
probability  of  exposure  varying  from  .800  to  .  ?00  and  a 
proportional  two-period  cumulation  from  .900  to  .970.  For 
these  very  high  values  we  have  generated  thirteen  differ¬ 
ent  cumulation  curves  and  frequency  curves,  the  extremes 
of  which  are  shown  in  the  graph  of  Figure  B-5.  The  graphs 
show  a  significant  variation  in  the  extremes  of  the  curves 
for  the  thirteen-period  cumulation.  The  smallest  value 
which  we  found  exposed  at  least  once  was  about  eighty 
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percent  of  the  population,  while  the  maximum  exposure 
was  approximately  ninety  percent  of  the  population.  Note 
that  the  range  is  about  ten  percent  of  the  population  or 
about  twelve  percent  possible  error  in  the  thirteen-period 
cumulation. 

If  we  look  at  the  three  cases  together,  we  seem 
justified  in  making  the  following  generalization:  the 
greatest  absolute  and  proportional  errors  occur,  or  are 
likely  to  occur,  in  those  distributions  for  which  the 
average  audience  and  two-period  cumulation  are  small. 

This  is  the  case  for  the  Better  Homes  and  Gardens  cumu¬ 
lation  curve  in  which  we  see  that  the  range  of  the  cumu¬ 
lation  curve  can  be  quite  large  and  seems  to  be  steadily 
increasing  for  the  one  extreme  curve  as  the  time  periods 
progress,  at  least  through  twelve  time  periods.  On  the 
other  hand,  for  the  data  where  the  average  audience  and 
cumulation  are  larger,  the  errors  or  variations  in  the 
curves  are  smaller  both  absolutely  and  proportionately, 
and  the  rate  of  growth  of  error  is  smaller.  In  general, 
we  might  make  a  rough  estimate  that  for  the  average 
newspaper  vehicle  our  estimate  of  the  cumulative  number 
of  people  exposed  may  be  in  erVor  by  as  much  as  ten  per¬ 
cent  after  thirteen  issues.^  For  a  radio  vehicle  which 


\ 
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°There  is,  however,  an  asymptote  to  the  cumulation 
curve,  and  therefore  after  many  time  periods,  the  cumul¬ 
ation  error  must  diminish.  This  effect  probably  happens 
earlier  for  those  vehicles  with  larger  average  audiences. 
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has  a  very  small  average  audience,  our  estimate  of  the 
cumulative  audience  may  be  in  error  by  as  much  as  ten  to 
fifteen  percent,  after  twelve  or  thirteen  issues.  This 
analysis  does  seem  to  show  however  that  for  many  vehicles 
the  proper  choice  of  parameters  (were  they  known)  would 
produce  cumulation  and  frequency  curves  closely  approxi¬ 
mating  the  empirical  curves.  Only  in  the  case  of  the 
Swedish  magazines  (discussed  in  more  detail  below)  do  we 
seem  to  have  some  regular  deviation  from  the  empirical 


curves . 
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The  Fit  of  the  One-and  Three-Beta  Function  Models  for  Thirty-Pro 
Vehicles 

In  order  to  explore  more  fully  the  feasibility  of  using 
the  one-or  three-bet3  function  models,  we  have  fitted  the  curves 
to  each  of  thirty-two  vehicles  and  in  the  table  below  we  present 
for  comparison  the  largest  real  cumulation  value  which  has  been 
measured  for  these  vehicles  and  the  comparable  cumulation  value 
derived  from  the  one-and  three-beta  function  models  by  fitting 
the  average  audience  and  the  two-period  cumulation. 

In  examining  these  figures  we  recall  once  more  that  the 
cumulation  values  generated  by  the  one-beta  function  model  are 
fixed  once  the  average  audience  and  the  two-period  cumulation 
are  known.  However,  for  the  three-beta  function  model  these 
two  values  do  not  fix  the  cumulation  curve  and  it  appears  from 
our  previous  explorations  that  the  values  for  twelve  or  thirteen 
time  periods  may  vary  by  as  much  as  ten  or  fifteen  percent  of 
the  population  (depending  on  the  choice  of  the  other  parameters) , 
cr  by  one  or  two  percent  times  the  number  of  time  periods 
beyond  the  second  for  which  the  values  are  calculated.  Thus, 
the  further  the  time  period  is  beyond  the  second  time  period, 
the  greater  the  variation  between  the  values  generated  by  the 
different  parameters  of  the  three-beta  function  model,  at  least 
up  to  thirteen  time  periods;  a  two  percent  error  at  time  period 
thirteen  may  be  considered  a  much  smaller  error  than  a  two  per¬ 
cent  error  at  time  period  four.  (For  example,  a  two  percent 
error  at  time  period  four  may  grow  to  a  ten  or  twelve  percent 
enor  at  time  period  thirteen.)  There  are  two  other  factors  to 
keep  in  mind  in  examining  the  table.  The  first  of  these  is  that 
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the  three-beta  function  model  cam  always  be  made  equal  to  the 
one-beta  function  model  simply  by  giving  each  of  the  distribu¬ 
tions  the  overall  average  audience  and  two-period  cumulation. 
Thus,  in  the  cases  where  the  one-beta  function  model  is  better 
than  a  particular  three-beta  function  model  we  must  remember 
that  the  three-beta  function  model  includes  the  possibility  of 
generating  exactly  the  same  values  as  the  one-beta  function 
model.  Finally,  the  values  of  the  parameters  in  the  three-beta 
function  model  were  chosen  on  an  intuitive  basis  by  the  re¬ 
searcher.  These  values  are  not  necessarily  the  best  choices 
and  some  experimentation  would  likely  produce  cumulation  closer 
to  the  empirical  calues,  especially  in  the  case  of  Swedish 
magazines . 

The  first  group  of  vehicles  are  fourteen  American  mass 
magazines  whose  audiences  were  studied  during  the  period  from 
1950  to  1961.  For  this  group  of  magazines  the  cumulation  was 
measured  from  four  to  thirteen  issues.  We  see  that  the  cumu¬ 
lations  derived  from  either  the  one-beta  function  or  the  three- 
beta  function  model  fit  these  data  quite  well.  The  errors  in 
the  one-beta  function  cumulation  are  generally  quite  small 
even  for  the  twelve  and  thirteen  time  period  cumulations  with, 
however,  one  systematic  error  in  that  the  model  generally 
understates  the  cumulation  by  a  very  small  amount.  The  three- 
beta  function  model  with  the  parameters  chosen  in  a  rather 
ad  hoc  fashion  by  the  researcher  does  reasonably  well  and  if 
we  recall  the  variation  possible  in  the  cumulation  with  a 
different  choice  of*  parameters  it  seems  clear  that  the  three- 
beta  function  model  can  be  made  to  fit  the  empirical  data 
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quite  well.  However,  if  the  empirical  data  did  not  include  the 
cumulations  for  the  three  distributions,  it  would  seen  that  the 
best  estimate  of  the  parameters  for  the  three-beta  function 
model  would  be  identical  with  the  parameters  for  the  one-beta 
function  model. 

The  second  group  of  vehicles  are  four  American  baby  mag¬ 
azines.  These  are  magazines  which  circulate  primarily  among 
the  population  of  pregnant  or  recently  pregnant  American  women 
and  thus  represent  a  somewhat  esoteric  type  of  vehicle.  For 
these  vehicles,  the  one-beta  function  model  does  not  do  well  at 
all  in  modeling  the  empirical  cumulation.  The  error  seams  to 
be  much  larger  for  those  magazines  which  have  a  very  large  em¬ 
pirical  cumulation.  The  three-beta  function  models  do  signifi¬ 
cantly  better  except  in  the  case  of  Baby  Talk  for  which  the 
three-beta  function  model  with  the  particular  parameters  chosen 
for  this  calculation  does  not  do  significantly  better  than  the 
one-beta  function  model.  We  conclude  from  this  group  of  vehicles, 
that  the  one-beta  function  model  is  not  rich  enough  to  model 
these  magazines'  audiences  but  that  very  likely  the  three-beta 
function  model  can  come  very  close  to  modeling  the  audiences. 

The  next  two  groups  of  vehicles  in  the  table  are  American 
radio  and  television  programs  whose  cumulations  were  measured 
in  the  1953  Politz  Study  of  Four  Media. ^  For  these  vehicles. 
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Alfred  Politz  Research,  Inc.,  A  Study  of  Four  Media;  Their 
Accumulative  and  Repeat  Audiences,  (New  York:  Time  Incorporated, 
1953) 


the  cumulation  has  been  measured  through  only  four  time  periods, 
but  for  that  cumulation  both  the  one-beta  function  and  three- 
beta  function  models  seem  sufficient.  One  must  note,  however, 
that  the  one-beta  function  and  three-beta  function  models  seem 
to  systematically  overestimate  the  cumulation  and  that  this 
overestimation  is  consistently  larger  on  the  part  of  the  three- 
beta  function  model.  It  would  seem  then  that  in  modeling  radio 
and  television  programs  with  no  knowledge  beyond  the  average 
audience  and  two-period  cumulation,  the  researcher  would  do 
better  using  the  three-beta  function  model  as  a  one-beta  function 
model  than  by  picking  ad  hoc  values  of  the  parameters. 

The  final  group  of  vehicles  consists  of  five  Swedish  mag¬ 
azines  which  were  studied  by  Schyberger  in  Malmo  in  1962.  For 
these  magazines  we  have  cumulation  values  through  twelve  issues. 
We  have  spoken  previously  about  the  need  to  adjust  Schyberger’ s 
figures  by  averaging  over  many  issues,  a  procedure  which  he  did 
not  use.  This  procedure  has  been  used  in  calculating  the  aver¬ 
age  audience  and  two-period  cumulation  for  these  magazines. 
Nevertheless,  the  cumulation  values  derived  from  either  the 
one-beta  function  or  the  three-beta  function  model  did  not  seem 
to  reproduce  well  the  cumulations  which  were  measured  empiri¬ 
cally;  the  derived  values  are  consistently  lower  than  the  ob¬ 
served  values  and  the  larger  the  observed  values,  the  more  in 
error  are  the  derived  values.  Even  for  Hemmets  Veckotidning 
for  which  we  explored  a  variety  of  parameters  for  the  three- 
beta  function  model  the  fit  is  not  good  for  any  of  the  derived 
curves.  Thus,  for  some  reason  the  beta  function  models  may  not 
fit  these  data  on  Swedish  magazines. 
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We  conclude,  then,  that  in  the  case  of  American 
mass  magazines  or  American  radio  and  television  programs, 
the  one  and  three-beta  function  models  seem  to  fit  the 
empirical  data  quite  well  with  the  three-beta  function 
model  offering  a  greater  possibility  of  very  closely 
fitting  the  empirical  cumulation  but,  however,  with  the 
disadvantage  that  the  eight  parameters  rather  than  the 
two  parameters  for  one-beta  function  model  must  be 
calculated.  For  the  case  of  American  newspapers  which, 
of  course,  are  quite  important  in  the  context  of  this 
simulation  we  have  no  data  on  cumulation  and  can  only 
point  out  that  the  three-beta  function  model  allows  a 
wide  range  of  cumulation  to  be  duplicated  and  that  for 
vehicles  with  very  large  average  audiences,  as  have  the 
newspapers  used  in  the  simulation  the  possibility  of 
sizeable  errors  in  cumulation  is  likely  to  be  considerably 
lower. 
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The  Prediction  of  the  Tvo-Pericd  Cumulation  from  the  Average 
Audience 

We  have  shown  that  in  most  cases  we  do  reasonably 
well  by  fitting  the  parameters  of  the  cumulation  curve 
using  the  empirical  values  of  the  average  audience  and  the 
two-period  cumulation.  We  have  also  suggested  (in  Chaprer'  V 
abS’ffe1)  that  it  seems  reasonable  to  expect  some  relationship 
between  them.  We  have  suggested  above  that  this  relation¬ 
ship  might  be  of  the  following  kind:  that  the  larger  the 
average  audience,  the  smaller  would  be  the  proportional 
increase  of  the  two-period  cumulation  over  the  average 
audience.  This  would  be  true  not  only  because  the  base 
of  the  proportion,  namely  the  average  audience,  is  larger, 
but  also  because  there  remains  a  smaller  proportion  of  the 
population  to  be  newly  exposed  when  t  second  issue  arrives. 
In  the  graph  below  we  have  plotted  the  two-period  relative 
accumulation  (the  ratio  of  the  two-period  cumulation  to 
the  average  audience)  against  the  average  audience  as  a 
proportion  of  the  population  for  thirty- two  vehicles, 
including  baby  magazines,  radio  and  television  shows, 

Swedish  magazines,  and  both  household  and  individual 
audiences  of  American  mass  magazines.  This  graph  shows 
clearly  a  very  strong  negative  linear  relationship  be¬ 
tween  the  relative  cumulation  and  the  proportion  of  the 
population  in  the  average  audience. 
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Although  it  seems  logical  to  expect  a  relationship 
such  as  the  one  found  in  the  graph  of  the  thirty- two  vehicles, 
we  must  at  this  point  sound  a  warning.  The  two-period  rela¬ 
tive  accumulation  contains  in  both  the  numerator  and  the 
denominator  the  term  C.  which  is  the  average  audience  Thus, 
even  if  the  amount  by  which  the  two-period  cumulation 
increases  over  the  average  audience  were  a  random  variable 
the  fact  that  we  add  to  this  number  the  average  audience 
and  then  divide  the  sum  by  the  average  audience  and  plot 
this  qur  cient  against  the  average  audience  would  give  us 
a  high  correlation  and  any  statistics  we  might  derive 
from  this  plot  would  be  grossly  misleading.  Therefore, 
in  running  our  correlations  we  have  correlated  the  pro¬ 
portion  of  the  population  in  the  average  audience  with 
the  proportional  increase  in  the  audience  from  the  average 
audience  to  the  two-period  cumulation.  Figure  B-7  below 
shows  a  plot  of  this  relationship.  From  this  graph  we 
see  that  the  relationship  is  nearly  linear  up  to  an 
average  audience  of  about  twenty-two  percent  of  the  popu¬ 
lation,  but  that  above  this  point  the  graph  is  not  necess¬ 
arily  linear  and  in  addition,  the  relationship  becomes 
quite  weak.  In  fact,  the  relationship  may  not  hold 
at  all  for  the  cases  of  the  Swedish  magazines. 

What  form  of  relationship  might  we  expect  to 
find  here?  For  the  first  graph  where  a  linear  relation¬ 
ship  seems  to  apply,  this  relationship,  can  be  written  in 
the rU«rieral  form  of  a  linear  equation  as  follows: 


Relative  Accumulation  =  +  b 

where  is  the  proportion  of  the  population  in  the  average 
audience,  C2  is  the  proportion  in  the  two-period  cumulation, 
and  a  and  b  are  constants  tc  be  determined.  If  we  then 
perform  t>3  operations  on  this  equation  which  take  the  im¬ 
plicit  value  of  the  average  audience  from  the  left-hand 
side  of  the  equation  and  leave  as  a  dependent  variable 
only  the  increase  in  the  total  audience  from  time  period 
one  to  time  period  two  as  a  function  of  the  average 
audience,  the  equation  then  becomes  a  quadratic  equation 
without  a  constant  term.  Thus,  multiplying  by  C^, 

C2  =  aCx  2  +  bC1 

and  subtracting  from  both  sides  of  the  equation, 

C2  -  CX  «  aC,2  +  (b-1)  CL 

or  2 

net  increase  =  aC^  +  b'C^* 


Therefore,  we  would  make  the  prediction  that  if  we  were 
to  run  a  regression  using  an  equation  of  this  type,  we 
would  find  that  the  coefficients  of  the  linear  and 
quadratic  terms  were  highly  significant  but  that  the 
constant  term  would  not  be  significant.  We  have,  in 
fact,  run  such  a  correlation  and  these  findings  happily 
are  obtained.  They  are  shown  in  Table  B-2  below. 
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Table  B-2.  Regression  Coefficients  and  Significance  Levels 


Quadratic 

Linear 

Term 

Term 

Constant 

<°i » 

Coefficient 

-.1117xl0~i 

.5869 

.1424X101 

Significance 

Level  (one-tailed) 

0.025>p>0.010 

,005>p>.00l 

0.25>p>C.l0 

The  coefficients  of  both  the  linear  and  the  quadratic  terms  are 
highly  significant,  but  the  constant  term  is  not.  Therefore,  we 
conclude  that  we  can  predict  the  net  increase  from  time  period 
one  to  time  period  two  in  the  proportion  exposed  from  the  equa¬ 
tion 

2 

Net  increase  «  1.424  +  .5869  -  .01117C^ 

with  a  multiple  correlation  coefficient  of  R,  _  *  .5851  and  a 

1.23 

standard  error  estimate  of  1.6429.  If  we  wish  to  predict  the 
proportional  two-period  cumulation  directly  from  the  equation  we 
find 

C  .  1.424  +  1.5869  C.  -  .01117  C.2. 

?.  11 

The  somewhat  misleading  multiple  correlation  coefficient  is 
12 

R  =  .9817  with  a  standard  error  of  estimate  of  1.6429. 

1.23 


12 

Recall  that  this  is  an  inflated  value. 


We  can  also  perform  the  regression,  forcing  the  constant 
term  to  be  zero  (recall  that  it  was  found  to  be  not  signifi¬ 
cantly  different  from  zero)  and  find  the  resulting  equation: 


C  =  1.726  C  -  .0142  C  2 
2  11 


(B-21) . 


For  this  ecnu.tion  R„  „  =  .9814  and  the  standard  error  of 

1.23 

estimate  is  1.6730. 

Equation  (B-21)  is  plotted  in  Fig.  B-6.  The  negative 
coefficient  of  the  squared  term  causes  the  required  downturn 
from  the  peak  of  the  curve  at  an  average  audience  of  about  26 
percent.  If  the  Swedish  data  points  were  deleted,  this  would 
probably  occur  at  a  somewhat  higher  value  and  the  entire 
curve  would  probably  be  shifted  slightly  upward.  At  any  rate, 
this  curve  does  seem  to  offer  a  good  prediction  of  the  two- 
period  cumulation  from  the  average  audience  for  American  mass 
magazines,  radio,  and  television  programs  for  values  of  the 
average  audience  from  about  8  percent  to  25  percent  of  the  pop 
ulation. 


Addition  of  "Subscribers "  to  the  Regression 

We  have  mentioned  one  other  logical  variable  fcr  a  regres 
sion  predicting  audience  cumulation;  namely,  the  proportion  of 
the  average  audience  that  are  subscribers  or  quite  regular 
readers  of  the  vehicle.  The  data  for  this  variable  were  much 
more  difficult  to  gather  than  the  simple  cumulation  values  and 
therefore,  we  have  estimated  it  from  either  the  proportion  of 
copies  going  to  subscribers,  or  from  other  data,  for  instance 
the  proportion  of  the  audience  exposed  nine,  ten,  eleven,  or 
twelve  times  out  of  twelve  for  a  vehicle.  For  the  twenty- twc 
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cases  where  the  proportion  of  the  audience  who  are  subscribers 
to  the  vehicle,  or  the  proportion  who  are  quite  frequent  users 
of  the  vehicle  is  known,  we  have  introduced  this  variable  into 
the  multiple  regression.  We  expect  to  find  that  the  cumulation 
is  a  positive  function  of  the  proportion  in  the  average  audi¬ 
ence,  a  negative  function  of  the  square  of  that  proportion, 
and  in  addition,  a  negative  function  of  the  proportion  of  the 
average  audience  who  are  "subscribers"  or  heavy  users  of  the 
vehicle.  The  results  of  the  regression  confirm  this  expecta¬ 
tion.  The  individual  coefficients  are  each  highly  significant 

and  each  bears  the  anticipated  sign.  The  multiple  correlation 

2 

coerficient  is  R  =  0.7724  (R  =  0.596;  p.0.008)  and 

1.234  1.234  * 

the  unbiased  estimate  of  the  standard  error  of  the  values 
estimated  from  the  regression  is  S  *  1.3313.  Thus,  the 

addition  of  the  estimate  of  the  proportion  of  “subscribers"  in 
the  average  audience  adds  significantly  to  our  ability  to  pre¬ 
dict  the  value  of  the  two-period  cumulation.  (The  inflated 

multiple  correlation  coefficient  for  this  case  is  R  = 

1.234 

.9875) . 

An  examination  of  the  residuals  shows  that  the  absolute 
errors  in  the  estimates  for  the  Swedish  magazine  range  from 
0.77  to  2.02  standard  errors,  (only  three  of  the  other  seven¬ 
teen  residuals  are  larger  than  .77  standard  errors  of  estimate). 
It  appears  that  the  cumulation  of  audiences  for  these  five 
magazines  differs  significantly  from  the  American  case  or  that 
the  data  roeasu  ament  was  not  comparable.  At  any  rate,  the 
removal  of  these  outliers  from  the  data  would  probably  somewhat 
increase  the  predictability  of  the  American  data. 


* 
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We  summarize  the  results  of  this  section  in  the 


following  points: 

1. 


Every  value  of  the  relative  accumulation 
which  we  have  found  (for  electronic  media 
and  magazines  in  the  United  States  and 
magazines  in  Sweden)  lies  within  the 
interval  1,16  to  1.73. 

We  can  make  predictions  of  the  proportion 
of  the  population  in  the  two-period  cumula¬ 
tive  audience  from  the  proportion  in  the 
average  audience  with  a  standard  error  of 
estimate  of  1.6429  from  the  equation 

C  *  1.4243  +  1.5896  C,  -  .01117c/. 

2 

We  can  make  a  better  prediction  using  some 
estimate  of  the  proporti'  of  the  average 
audience  who  are  "subscribers ,  with  a 
standard  error  of  estimate  of  1,3313,  from 

2 

C  =  -  2.1211  x  2.1327  C  -.02657  C,  - 
2  11 

.02458  (S/C  ). 

These  equations  fit  best  for  values  of  the 
average  audience  in  the  range  from  5  to  25  per¬ 
cent  of  the  population.  Beyond  this  point,  the 
functions  do  become  monotcnically  decreasing  as 
they  logically  should,  but  we  have  no  data  with 
which  to  check  the  fit. 

The  equations  seem  to  hold  well  for  electronic 
media  and  magazines  in  the  United  States,  They 
do  not  seem  to  hold  well  for  Swedish  magazines 
and  we  have  no  data  for  newspapers  anywhere.  In 
addition,  we  must  note  that  the  data  were 
gathered  in  the  period  from  1950  through  1962, 
and  although  we  have  no  reason  to  believe  that 
the  world  has  changed  in  this  respect  since 
then,  we  must  nevertheless  be  cautious  about 
extrapolating  these  data  to  other  time  period s- 
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Table  B-6.  The  Average  Audience  and  Relative  Cumulation  of  Pour 
Baby  Magazines  in  the  Population  of  Pregnant  Females 


Magazines 

Average 

Audience 

Two-Period 

Cumulation 

Relative 

Cumulation 

ci 

C2 

V 

c 

1 

"American  Baby" 

5.7?ba 

9.8%a 

1.72 

“Baby  Talk" 

13.6 

25.7 

1.33 

"My  Baby" 

14.9 

23.2 

1.56 

“Your  New  Baby" 

17.2 

26.8 

1.56 

a 

The  percentages  in  this  column  represent  proportions  exposed 
of  four  baby  magazines  in  the  population  of  pregnant  or  re¬ 
cently  pregnant  females  (n  =  3,685,000). 

Source:  The  data  are  taken  from  an  Audits  &  Surveys,  Inc., 
Target  Market  Study  of  Eight  Baby  Magazines,  August,  1955. 


APPENDIX  C 


INSTRUCTIONS  FOR  CONTENT  ANALYSIS  OF  CINCINNATI  NEWSPAPERS 


We  wish  to  locate  the  occurrences  in  the  mass 
media  of  certain  selected  themes  for  the  period  September 
15,  1947  through  March  1,  1948.  The  particular  themes 
are  described  below.  Obviously  a  single  news  item  may 
carry  more  than  one  theme,  e.g.  a  story  un  the  Palestine 
debate  in  the  U.N.  might  emphasize  both  the  peacekeeping 
rcle  of  the  U.b.  (Theme  1)  and  also  the  dissension  and 
dispute  among  the  Great  Powers  (Theme  2) ,  In  this  case 
both  themes  would  be  recorded  for  this  story. 

Certain  of  the  questionnaire  items  whose  responses 
are  especially  likely  to  be  affected  by  the  presence  of 
the  themes  in  the  mass  media  are  reproduced  below.  The 
item  numbers  following  each  theme  indicate  the  relevant 
questionnaire  item  or  items. 

Themes  relating  to  the  U.N. 

1.  U.N.  Peacekeeping.  Any  message  which  relates 
the  U.N.  to  peacekeeping,  promoting  harmony  among  nations, 
discouraging  aggression,  discussing  world  affairs  peace¬ 
fully,  settling  world  problems,  etc.  Includes  editorials 


urging  upon  the  U.N.  some  peacekeeping  action. 
Items  la,  2c,  5,  8,  9,  10,  11,  12. 


2.  Dissension  in  the  U.N.  Messages  discussing 
disagreements  in  the  U.N.  among  the  Great  Powers  (the 
US.,  England,  Russia),  bickering,  arguing,  name-calling 
lack  of  unity,  tco  much  talk  and  not  enough  action,  etc. 
Note  that  the  message  must  explicitly  mention  the  U.N. 
Items  lb,  2a,  8,  9,  10,  12a. 


3.  The  Veto  in  the  U.N.  Security  Council.  An 


message  mentioning 


about  the  veto  power  in  the 


U.N. ,  threats  to  veto  proposals,  number  of  Russian  vetos. 


etc. 


If  the  story  mentions  that  the  veto  cam  be  used 


only  in  the  Security  Council,  please  check  this  on  the 
code  sheet. 

If  the  message  somehow  implies  that  the  veto  power 
means  that  the  big  powers  must  agree  on  an  -  ion,  pr  .e 
check  this  on  the  code  sheet. 

Items  7,  12d. 


4.  The  U.N.  and  the  Rights  of  Man.  Any  message 
from  which  the  reader  can  infer  that  the  U.N.  is  concerned 
about  equal  rights  for  all  people  everywhere,  e.g.  Eleanor 
Roosevelt's  columns  on  the  work  of  the  U.N.  Human  Rights 
Commission. 

Item  6a. 


5.  The  U.N.  and  World  Trade.  Messages  which 


cause  the  reader  to  conclude  that  the  U.N.  has  a  respon¬ 
sibility  to  further  trade  between  nations.  Reports  of 
U.N.  debates  on  trade,  tariffs,  etc. 

Items  6c,  lib. 

6.  The  U.N.  and  World  Health  Conditions. 

Messages  which  imply  that  one  task  of  the  U.N.  is  to 
improve  health  conditions  in  different  parts  of  the  world. 
Famine  relief. 

Note  that  UNESCO  may  be  named  in  this  message  as 
the  agency  with  this  task.  If  the  U.N.  is  not  mentioned 
explicitly,  please  note  this  in  the  space  provided. 

Items  6b,  11a. 

7.  The  Slogan  of  the  U.N.  Information  campaign. 
"Peace  begins  with  the  United  Nations;  the  United  Nations 
begins  with  you."  Any  message  containing  this  slogan. 
Items  16,  17. 

8.  Cincinnati  Plan  Sponsors.  Any  message  con¬ 
cerning  the  American  Association  for  the  United  Nations 
or  the  U.N.  Association  of  Cincinnati  and/or  their 
efforts  to  promote  understanding  of  the  U.N. 

Items  14,  15 . 

9.  Explicit  Explanation  of  the  U.N.  message 

concerned  primarily  with  explaining  the  purposes  and/or 
workings  of  the  U.N. 

Items  5,  6. 
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10.  Satisfaction  with  the  U.N.  Any  message  con¬ 
veying  approval  of  the  U.N.  or  satisfaction  with  its 
progress  ?nd  policies.  Comments  generally  favorable 
toward  the  U.N. 

Items  8,  10,  11. 

11.  Dissatisfaction  with  the  U.N.  Messages  which 
convey  hostility  toward  or  displeasure  with  the  U.N.  or 
dissatisfaction  with  its  progress.  Comments  generally 
unfavorable  toward  the  U.N. 

Items  8,  10,  12. 

Themes  relating  to  other  international  issues 

12.  Control  of  the  Atomic  Bomb.  Messages  Which 
relate  to  control  of  the  bomb,  maintaining  the  U.S. 
secrets,  possible  use  of  the  bomb,  likelihood  of  Russia's 
developing  the  bomb,  etc.  Messages  which  further  the 
reader's  concern  and  interest  in  the  control  of  the  bomb. 
Ominous  warnings  and  predictions. 

Items  lc,  2b,  6d. 

13 .  Russian- American  Relations.  Any  message 
reminding  the  reader  of  growing  hostility  between  the  U.S. 
and  Russia,  e.g.  messages  concerning  Russian  foreign  policy 
moves  whichare  inimical  to  U.S.  interests;  or  concerning 
Russian  warnings  and  threats  (as  in  the  U.N.  exchanges 
between  Dulles  and  Gromyko) . 

Items  la.  lb,  2a,  3,  4,  8 


14.  Threats  to  Peace.  Messages  which  raise  the 
spector  of  war;  e.g.  messages  concerning  Russian  threats 
present  wars  as  in  Palestine  or  Greece.  Discussions  cf 
the  problems  of  keeping  peace  and  the  likelihood  of 
another  war.  Dire  predictions. 

Items  la,  2a,  3.  4. 
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QUESTIONNAIRE  ITEMS 


1.  "When  you  think  of  the  problems  facing  the  United 
States  now,  which  ones  come  to  your  mind  first?  What 
other  problems  do  you  think  of?" 


International  Problems 

March,  1348 

Sept. ,  1947 

a) 

Another  war,,  maintaining 
peace 

46% 

24% 

b) 

Relations  with  Russia 

29% 

16% 

c) 

Control  of  the  atomic  bomb 

1% 

4% 

d) 

United  Nations 

2% 

1% 

2.  "We'd  like  to  know  how  much  interest  the  public  takes 
in  a  number  of  questions.  For  example,  do  you  your¬ 
self  take  a  keen  interest,  only  a  mild  interest,  or 
practically  none  at  all  in  news  about:" 

Proportions  taking  a 
^H^en"  interest  ’ 
March,  1948  Sept. ,  1947 

a)  Our  relations  with  Russia  68%  54% 

b)  The  control  of  the  atomic 

bomb  56%  51% 

c)  The  United  Nations  34%  31% 

3.  "Do  you  expect  the. Uni ted  States  to  fight  in  another 

war  within  the  next  ten  years?" 


March,  1948 

Sept.,  1947 

Yes 

73% 

48% 

No 

15% 

38% 

Don't  Know 

12% 

14% 

rmrr 

vm 

4.  "Do  you  think  we  can  count  on  Russia  to  meet  us  half** 
way  in  working  out  problems  together?" 

March,  1948  Sept.,  1947 


Yes 

10% 

14% 

No 

80% 

74% 

Don't  Know 

10% 

12% 

16o4 
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Changes  in  information  about  the  United  States 

5.  "What  would  you  say  is  the  main  purpose  of  the  United 
Nations  organization?" 

March ,  1948  Sept.,  1947 

Proportion  unfamiliar  with 

the  United  Nations:  28%  30% 


(The  following  questions  were  asked  only  of  those  who  are 
familiar  wiih  the  United  Nations.) 

6.  "As  far  as  you  know,  is  the  job  of  the  United  Nations 
to  .  .  . " 


a) 

b) 


c) 

d) 


e) 


f) 


See  that  all  people  every¬ 
where  get  equal  rights  (yes) 
Improve  health  conditions  in 
different  parts  of  the  world 
(yes) 

Increase  trade  between  coun¬ 
tries  (yes) 

Deal  with  disarmament  and 
control  of  the  atomic  bomb 
(yes) 

Set  up  a  new  world  language 
to  be  used  in  all  countries 
(no) 

Work  out  peace  treaties  with 
Germany  and  Japan  (no) 


Proportions  answering 
each  item  correctly 


March,  1948  Sept., 

60% 

55% 

55% 

50% 

50% 

47% 

43% 

46% 

34% 

38% 

10% 

10% 

7.  "Have  you  heard  or  read  anything  at  out  the  veto  power  in 
the  United  Nations?" 


March,  1948  Sept. ,  1947 


a) 

Yes 

37% 

34% 

b) 

Proportion  who  could  explain 
the  working  of  the  veto  in 
terms  of  big  power  unanimity 

7% 

7% 

c)  Proportion  aware  that  the 
veto  power  could  only  be 
used  in  the  Security  Council 
and  not  in  the  General  Assem¬ 
bly  as  well 


8% 


7% 


Changes  in  opinion  about  the  United  Nations 


8.  "In  general,  are  you  satisfied  or  dissatisfied  with  the 
progress  Jiat  the  United  Nations  organization  has  jaade 
so  far? 


March,  1948  Sept.  1947 


Satisfied 

29% 

34% 

Dissatisfied 

33% 

28% 

Don't  Know 

10% 

8% 

~T2T 

9.  "Do  you  think  the  United  Nations  organization  will  suc¬ 
ceed  in  spite  of  the  disagreements  that  have  come  up 
among  England,  Russia  and  the  United  States,  or  do  you 
think  these  disagreements  are  so  serious  that  the  United 
Nations  organization  will  fail?" 


Proi 


entire 

March, 

1948 

sample 
Sept. , 
1947 

those 

March, 

1948 

queried 
Sept. , 
1947 

Will  Succeed 

35% 

43% 

48% 

62% 

Will  Fail 

25% 

17% 

34% 

24% 

Don't  Know 

12% 

10% 

-J1K 

18% 

ICRST 

14% 

166% 

10.  "Some  people  say  there  are  so  many  disagreements  in  the 
United  Nations,  that  we  would  be  better  off  to  get  to~ 
gether  with  other  countries  and  work  on  international 
problems  outside  the  U.N.” 

"Other  people  say  that  working  through  the  U.N.  is  the 
best  way  to  preserve  peace." 

"How  do  you  feel  about  this?  (Should  we  work  separately 


with  any  countries  that  want  to  join  us,  or  should  we 
work  mainly  through  the  United  Nations?)" 

Proportions  among  Proportions  among 

-  entire 
March, 
1948 

sample 

Sept. , 

1947 

those 

March, 

1948 

queried 
Sept. , 
1947 

Work  Separately 

12% 

10% 

17% 

14% 

Through  U.N. 

55% 

57% 

77% 

61% 

Don't  Know 

5% 

~7US 

3% 

-jus 

6% 

looi 

5% 

rm 

V  jr.  >  •*  *»/*  ;:  ,v  *o 


fessaiatoiaicarti 


<g»«rttv."nw>y  **  *  : 


11.  "What  would  you  say  are  some  of  the  good  things  the 
United  Nations  has  done  so  far?" 


Proportions 
among  entire 


Marc 


sample 
ch,  S< 


Sept. 


Proportions 
among  those 
queried 
March',  Sept . 


1948 

1947 

1948 

1947 

Handling  of  overseas 
food  and  relief 

9.% 

6% 

12% 

8% 

Creates  or  is  creating 
unity  between  nations, 
has  ended  isolationism, 
breaks  down  cultural  or 
trade  barriers 

8% 

7% 

11% 

10% 

Handling  of  specific 
issues :  Palestine , 
Greece,  atomic  bomb, 
UNESCO,  occupation 
forces,  etc. 

6% 

6% 

8% 

8% 

Helps  nations  to  get 
together  to  talk  over 
problems;  acts  as  a 
forum  or  sounding 
board 

5% 

5% 

7% 

8% 

Has  helped  small 
nations,  curbed  big 
ones 

3% 

2% 

4% 

3% 

Has  or  is  trying  to 
curb  Russian  power 

1% 

1% 

1% 

2% 

Is  creating  better  un¬ 
derstanding  between 
Russia  and  the  rest  of 
the  world 

* 

* 

* 

* 

Vague  general  approval 
of  U.N. 

8% 

7% 

10% 

9% 

4J.N.  has  done  nothing 
good  so  far 

11% 

9% 

14% 

12% 

Don't  know,  unable  to 
answer 

29% 

31% 

41% 

44% 

80%1 

74%1 

108S1 

104% 

These  columns  add  to  more  than  their  respective 
totals  of  72%,  70%,  100%  and  100%  because  a  few  respondents 
gave  more  than  one  response. 
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The  criticisms  of  the  United  Nations  were; 

12.  ''What  would  you  say  are  some  of  the  bad  things  about 
the  United  Nations  so  far?" 


a)  U.N.  is  failing,  too 
nrich  talk  and  not 
enough  action,  lack 
of  unity,  bickering. 


Proportions 
among  entire 
sample 

March, .Sept. , 
1948  1947 


Proportions 
among  those 

March*,  '  “Sept . , 
1948  1947 


arguing 

19% 

13% 

25% 

18% 

b)  Russian  power  is  not 
curbed  enough;  diffi¬ 
culties  with  Russia; 
Russia  should  be 
barred 

11% 

13% 

14% 

17% 

c)  Handling  of  specific 
issues :  not  enough 
relief  to  destitute 
countries,  Palestine, 
Greece,  failure  to  pro¬ 
ceed  with  disarmament 
and  control  of  atomic 
bomb 

6% 

3% 

8% 

4% 

d)  Big  nations  have  too 
much  power;  should  do 
away  with  veto;  make 
all  nations  equal 

3% 

2% 

4% 

3% 

e)  U.S.  doesn't  have 

enough  power;  is  being 
taken  advantage  of 

1% 

1% 

2% 

1% 

f)  Vague  general  disap¬ 
proval  of  U.N. 

5% 

3% 

7% 

4% 

g)  Everything  is  bad  about 
it;  should  never  have 
joined,  should  leave 
it  now 

1% 

1% 

2% 

2% 

h)  Nothing  bad  about  U.N.; 
just  growing  pains 

7% 

5% 

9% 

6% 

i)  Don't  know,  unable  to 
answer 

27% 

35% 

37% 

51% 

80%X 

76%X 

108%X 

106%1 

These  columns  add  to  more  them  their  respective 
totals  of  72%,  70%,  100%  and  100%  because  a  few  respondents 
gave  more  than  one  answer . 
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"Do  you  happen  to  belong  to  any  groups  or  organizations 
or  attend  any  meetings  where  they  talk  about  world  af¬ 
fairs  like  the  United  Nations?  (If  'No')  Is  there  any 
particular  reason  why  you  don't?" 


a) 

b) 

c) 


d) 

e) 


f) 

g) 

h) 

i) 


Proportions 

Proportions 

among 

entire 

among 

those 

sample 

queried 

Marchy 

Sept. , 

March, 

Sept. , 

1948 

1947 

1948 

1947 

Belong  or  partici¬ 
pate 

Haven't  been  asked 

12% 

11% 

16% 

15% 

to  join,  not  familiar 
with  any  groups  to 
join 

Time  s  Haven 1 1  got 
the  time,  too  busy, 
other  activities 

3% 

3% 

4% 

4% 

interfere 

13% 

16% 

19% 

23% 

Just  not  interested 

2% 

3% 

4% 

6% 

Personal  reasons :  age 
health,  newness  in 

» 

community,  etc. 

Dislike  belonging  to 

9% 

5% 

11% 

6% 

any  organization 
wouldn't  do  any  good. 

1% 

2% 

2% 

3% 

unnecessary,  none  of 
my  business 

1% 

1% 

1% 

1% 

Opposed  to  United 
Nations 

Don't  know  why;  no 

* 

* 

* 

* 

particular  reason 

31% 

29% 

43% 

42% 

~7J% 

~7UT 

W% 

"Do  you  happen  to  know  the  names  of  any  groups  or  organi 
zations  here  in  Cincinnati  that  are  trying  to  help  the 
United  Nations?" 


Proportions 
among  those 


Proportions 
among  entire 
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The  groups  named  scattered  over  all  possible  kinds. 
About  2%  cited  either  the  American  Association  of  the 
United  Nations  or  the  United  Nations  Association  of 
Cincinnati. 

15.  "In  your  opinion,  can  these  groups  and  organizations 
(groups  and  organizations  that  are  trying  to  help 
the  United  Nations)  do  anything  to  help  the  United 
Nations  in  a  practical  way?" 


Proportions 
among  entire 


Proportions 
among  those 


sample 

March,  ‘  Sept. , 

queried 
March,  Sept. , 

1948 

1947 

1948 

1947 

Yes 

46% 

40% 

65% 

57% 

No 

12% 

10% 

16% 

14% 

Don't  Know 

14% 

20% 

19% 

29% 

72"% 

"TVS 

HR)T 

loci 

16.  "During  the  last  six  months,  have  you: 

Proportions 
saylng~J,yesv 
among  entire 
sample^- 

a)  .  .  .  seen  anything  in 
the  newspaper  about  the 

United  Nations?  59% 

b)  ,  .  .  heard  any  radio 
news  programs  about  the 

United  Nations?  53% 

c)  .  .  .  heard  any  short 

radio  mentions  of  the 
United  Nations  between 
programs?  26% 

d)  .  .  .  seen  any  signs  or 

posters  about  the  United 
Nations?  21% 


Proportions 
saying  Yes* 
among  those 

queried 


83% 

74% 


36% 

29% 


^The  complementary  figures  for  those  shown  in  this 
column  are  the  differences  between  them  and  72%.  Thus  59% 
saw  newspaper  coverage  of  the  Jnited  Nations,  13%  did  not, 
and  28%  were  not  asked  the  question;  and  similarly  for  each 
item. 


2 

The  difference  between  each  of  these  figures  and 
100%  is  the  proportion  not  exposed  to  each  of  the  media  among 
those  who  were  familiar  with  the  United  Nations. 


16.  Continued 


e)  .  .  .  heard  anything 
about  the  United 
Nations  in  church? 

.f)  .  .  .  read  any  leaflets 
or  pamphlets  on  the 
United  Nations? 

g)  .  .  .  been  to  any  meet¬ 
ings  where  the  United 
Nations  was  talked 
about  or  discussed? 

h)  .  .  .  seen  or  heard 
anything  else  about  the 
United  Nations? 


12% 

10% 

9% 

21% 


17% 

14% 

12% 

29% 


The  main  sources  of  United  Nations'  coverage  mentioned 
under  the  "anything  else"  category  were  nationally 
circulating  periodicals,  newsreels  and  personal 
conversations . 

The  exact  items  mentioned  were: 


a)  Conversations ,  talking 
to  people 

b)  Nationally  circulating 
periodicals,  magazines 

c)  Newsreels,  movies 

d)  Through  school  children 

e)  St.  Xavier  University 
Pageant 

f )  Books 

g)  Full  length  radio  pro¬ 
grams  other  than  news 


Proportions 
among  entire 
sample 

Proportions 
among  those 
queried 

7% 

10% 

7% 

9% 

4% 

5% 

2% 

3% 

1% 

1% 

1% 

1% 

1% 

1% 

17.  "Have  you  ever  seen  or  heard  the  slogan:  'Peace  begins 
with  the  United  Nations;  the  United  Nations  begins  with 
you'?  (If  'Yes")  Do  you  recall  where  you  saw  or  heard 
it?" 
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